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(54) AUDIO DATA STRUCTURE, RECORDING MEDIA, AND PROCESSOR 



(57) In audio contents which have cells for defining 
an audio title playback unit and whose actual playback 
sequence is determined by defining the playback 
sequence of the cells, celt information to specify the 
cells is provided with identification information to identify 
the types of the cells according to the difference in the 



contents of the data included in the cells. One type of 
the contents of the data in the ceils is for obtaining the 
length of the silent period of time. The identification 
information corresponding to the cell indicates a silent 
cell. 
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Description 

Technical Field 

[0001] This invention relates to an audio data struc- 5 
ture which facilities the handling of high sound-quality 
audio data in processing (recording, reproducing, trans- 
mitting, and constructing) the data and fulfills the high 
sound-quality requirement, a recording medium for the 
audio data structure, and an apparatus for processing 10 
its signal. 

Background Art 

[0002] DVD video disks are optical disk on which 15 
video (moving picture) information can be recorded very 
densely with high quality and further various types of 
information, including multiangle pictures, sub-pictures, 
multilingual voice, and multichannel audio, can be 
recorded. Such DVD video disks have been developed 20 
and on the market and are finding their way into wide 
application. (DVD is an abbreviation for Digital Versatile 
Disk.) 

[0003] The specifications for DVD video disks cover 
not only compressed multichannel audio {including AC- 
3 and MPEG) but also uncompressed linear PCM 
(including 48 kHz sampling, 16-bit quantization, 96 kHz 
sampling, and 24-bit quantization). The DVD video lin- 
ear PCM meets the high-frequency sampling, high- 
number bit, high sound quality specifications, which sur- 
pass those for conventional music CDs (with 44.1 kHz 
sampling and 16 bit quantization). Linear PCM with 96- 
kHz sampling and 20- to 24-bit quantization is suffi- 
ciently qualified for the next generation digital audio 
disks (what is called super CDs or super audio disks). 
[0004] However, the DVD video specifications have 
been determined by video requirements rather than 
audio requirements. In terms of not only sampling fre- 
quency and the number of quantization bits but also the 
number of recordable channels and recordable time, 
audio-oriented specifications surpassing the DVD video 
sound specifications have been expected. 
[0005] To meet the expectation, DVD audio specifica- 
tions have been studied (it should be noted that the DVD 
audio specifications have not been in the prior art yet). 
The DVD audio specifications have been considered to 
be capable of supporting linear PCM with 48- to 96-kHz 
sampling and 24-bit quantization employed in the DVD 
video specifications up to linear PCM with 192-kHz 
sampling and 24-bit quantization. Moreover, future ver- 
sions of the DVD audio specifications might introduce 
much higher sound quality. 

[0006] The reason why the DVD audio provides 
upward compatibility is that it has a part shared with 
DVD video that can record a large volume of data cover- 
ing high-definition television images. Moreover, the DVD 
audio is characterized by having technical, marketable, 
and economical advantages in the future when it can be 



used as a result of advance in the DVD video. 
[0007] For example, when high-capacity DVD disks to 
be put in practical use in future DVD video are used in 
DVD audio, if the recording time is constant, there is a 
possibility that the sampling frequency in recording, the 
number of quantization bits, and the number of record- 
ing channels will be increased more and more. In addi- 
tion, the technique for DVD video recorders using DVD- 
RAMs (or rewritable DVD-RW or write-once DVD-R) to 
be put in practical use in the near future can be used in 
DVD audio recorders to come in practice soon or later. 
[0008] Furthermore, as the popularization of DVD 
video expands its market, DVD video and DVD audio 
share increasingly more of the recording mediums 
(including DVD-ROM, disks, DVD-RAM/DVD-RW disks, 
and DVD-R disks), unit parts (including disk drives, opti- 
cal pickups, and various types of ICs), and various con- 
trol programs. This accelerates the cost reduction of 
DVD audio products featuring high sound quality and 
other advantages. When DVD audio is used widely, 
DVD video will enjoy the future technical, marketable, 
and economical advantages available as a result of 
advance in DVD audio. 

[0009] As described above, the development of DVD 
audio has been expected, but, as seen from the afore- 
mentioned DVD video, DVD audio with various functions 
and performances will possibly be proposed and devel- 
oped as a result of a high-density recording disk having 
been developed. Specifically, there is a possibility that 
DVD audio with a different data structure in terms of 
sampling frequency, the number of quantization bits, 
and the number of channels will be produced. Moreo- 
ver, DVD audio with a different data structure in terms of 
functions, such as DVD audio with or without menu 
images, or DVD audio with or without background 
images, will possibly be produced. 
[0010] Accordingly, an object of the present invention 
is to provide a data structure that enables audio 
attributes to be specified track by track. The data struc- 
ture makes it possible to allow the reproduction side to 
deal with DVD audio easily even if various functions and 
performances are included in the DVD audio. 
[0011] The reproduction side needs a preparation 
time for changing the hardware devices according to the 
change of the attributes. The preparation time causes a 
break in the sound output. Accordingly, another object 
of the present invention is to provide a data structure 
which positively recognizes a break in sound and allows 
the designer or producer to set a sound break period 
arbitrary. The data structure makes it possible to make 
silent periods between pieces of music constant when, 
for example, a DVD audio disk is played back, which 
provides the user with a stable playback condition. 

Disclosure of Invention 

[0012] To achieve the foregoing objects, identification 
information to identify the type of cells by the difference 
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in the contents of the data included in the cells is pro- 
vided in cell information to specify the cells in audio con- 
tents that have cells defining at least an audio title 
playback unit and determines the actual playback 
sequence by defining the playback sequence of the 5 
cells. This enables the data structure creator side to 
deliberately realize data processing management, tim- 
ing management, and setting management on the 
reproducing apparatus side during playback according 
to the contents of the data on the basis of the identifica- w 
tion information. 

[0013] One type of the contents of the data in the cells 
is for determining the length of a silent period of time. 
The identification information corresponding to the cell 
is characterized by indicating a silent cell. Providing a 15 
silent cell for determining the time of the silent period 
enables a silent period to be set. Using the silent period, 
the reproducing apparatus side can change or set the 
attributes. In a case where tracks with a break in sound 
are mingled with tracks without a break in sound, effec- 20 
tive use of silent cells at the head of a track without a 
break in sound makes it possible to edit the data in such 
a manner that a uniform pause period giving no unnatu- 
ral feeling on the whole is taken. 

25 

. Brief Description of Drawings 
[0014] 

FIG. 1 is a perspective view to help explain the con- 30 
figuration of an optical disk usable as a DVD audio 
recording medium; FIG. 2 is a view to help explain 
the correlation between the data recording area on 
the optical disk of FIG. 1 and recording tracks of 
data recorded in the area; FIG. 3 is a diagram to 35 
help explain the hierarchical structure of the infor- 
mation recorded in a DVD audio zone among the 
various types of information recorded on the optical 
disk of FIG. 2; FIG. 4 is a diagram to help explain an 
example of the data structure of AOTT_AOBS 40 
among the pieces of information recorded in the 
DVD audio zone of FIG. 3; FIG. 5 is a diagram to 
help explain the hierarchical structure of the infor- 
mation recorded in a DVD video zone among the 
various types of information recorded on the optical 45 
disk of FIG. 2; FIG. 6 is a diagram to help explain an 
example of the data structure of VTSTT.VOBS in 
the information recorded in the DVD video zone of 
FIG. 5; FIG. 7 is a diagram to help explain an exam- 
ple of video information (e.g., VTS_C #2) accessed so 
by both program chain information (ATS.PGCI) in 
the DVD audio zone of FIG. 3 and program chain 
information (VTS.PGCI) in the DVD video zone of 
FIG. 5; FIG. 8 is a diagram to help explain an exam- 
ple of the data structure that is about the recorded 55 
contents of user-accessible DVD audio and is 
recorded on one side of the optical disk shown in 
FIG. 1 ; FIG. 9 is a diagram to help explain an exam- 



ple of the directory structure of the information (in 
data files of DVD audio and DVD video) recorded 
on the optical disk of FIG. 1 ; FIG. 1 0 is a diagram to 
help explain another example of the directory struc- 
ture of the information (in data files of DVD audio 
and DVD video) recorded on the optical disk of FIG. 
1 ; FIG. 1 1 is a diagram to help explain a case where 
the directory on the audio content side accesses a 
file in the directory on the video content side in the 
directory structure shown in FIG. 9; FIG. 12 is a dia- 
gram to help explain a case where a file in the direc- 
tory on the audio content side links with a file in the 
directory on the video content side in the directory 
structure shown in FIG. 9; FIG. 13 is a diagram to 
help explain an example of how file accessing of 
FIG. 1 1 is effected in the volume spaces shown in 
FIG. 3 of 5; FIG. 14 is a diagram to help explain 
another example of how file accessing of FIG. 1 1 is 
effected in the volume space shown in FIG.*3 of 5; 
FIG. 15 is a diagram to help explain still another 
example of how file accessing of FIG. 1 1 is effected 
in the volume space shown in FIG. 3 or 5; FIG. 16 
is a diagram to help explain the recorded contents 
of audio manager information (AMGI) in the DVD 
audio zone shown in FIG. 3; FIG. 17 shows the 
recorded contents of the audio manager informa- 
tion management table (AMGLMAT) included in 
the audio manager information (AMGI) shown in 
FIG. 16; FIG. 18 is a diagram to help explain the 
contents of the audio title search pointer table 
(ATT_SRPT) included in the audio manager infor- 
mation (AMGI) shown in FIG. 16; FIG. 19 is a dia- 
gram to help explain the contents of the audio title 
search pointer (ATT_SRP) included in the audio 
title search pointer table (ATT_SRPT) shown in 
FIG. 18; FIG. 20 is a diagram to help explain the 
contents of the audio-only title search pointer table 
(AOTT_SRPT) included in the audio manager infor- 
mation (AMGI) shown in FIG. 16; FIG. 21 is a dia- 
gram to help explain the contents of the audio-only 
title search pointer (AOTT_SRP) included in the 
audio-only title search pointer table (AOTT.SRPT) 
shown in FIG. 20; FIG. 22 is a table showing the 
relationship between a group of audio-only titles 
(AOTT_GR) accessed using the audio-only title 
search pointer (AOTT_SRP) in the audio manager 
information (AMGI) shown in FIG. 16 and a group of 
audio titles (ATT_GR) accessed using the audio 
title search pointer (ATT_SRP) in the audio man- 
ager information (AMGI); FIG. 23 is a diagram to 
help explain the recorded contents of an audio title 
set (ATS) in the DVD audio zone shown in FIG. 3; 
FIG. 24 shows the recorded contents of the audio 
title set information management table (ATSLMAT) 
included in the audio title set information (ATSI) 
shown in FIG. 23; FIG. 25 is a diagram to help 
explain the contents of the audio title set program 
chain information table (ATS_PGCIT) included in 
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the audio title set information (ATSI) shown in FIG. 
23; FIG. 26 is a table showing the contents of the 
audio title set program information (ATS_PGI) 
shown in FIG. 25; FIG. 27 is a table showing the 
contents of the audio title set cell playback informa- 5 
tion (ATS_C_PBI) shown in FIG. 25; FIG. 28 is a 
diagram to help explain the contents of the audio 
title set audio still video playback information table 
(ATS_ASV_PBIT) shown in FIG. 25; FIG. 29 is a 
table showing the contents of the audio title set pro- w 
gram audio still video playback information search 
pointer (ATS_PG_ASV_PBI_SRP) shown in FIG. 
28; FIG. 30 is a block diagram of an apparatus for 
reproducing the recorded information in the DVD 
audio zone of FIG. 3 or the recorded information in 75 
the DVD video zone of FIG. 5 from the optical disk 
of FIG. 1; FIG. 31 is a front view of an example of 
the front panel of the reproducing apparatus of FIG. 
30; FIG. 32 describes the types of audio data cells 
forming the important part of the present invention; 20 
FIGS. 33A and 33B are diagrams to help explain 
the types of audio title set program and the data 
allocation structure; FIG. 34 is a diagram to help 
explain an example of audio-only data pack trains in 
the audio-only title; FIG. 35 is a diagram to help 2 5 
explain pack trains when audio-only title audio and 
real-time data are present; FIG. 36 is a diagram to 
help explain pack trains when audio-only title audio 
cells and a silent cell are present; FIGS. 37A to 37C 
are diagrams to help explain a pack train where pro- 30 
grams in front of and behind a program have the 
same attributes and are composed of only audio 
cells, and the change of the presentation time 
stamp and the change of the playback time caused 
by the playback sequence; FIGS. 38A to 38C are 35 
diagrams to help explain a pack train where pro- 
grams in front of and behind a program have differ- 
ent attributes and are composed of audio cells 
including silent cells, and the change of the presen- 
tation time stamp and the change of the playback 40 
time caused by the playback sequence; FIG. 39 is a 
block diagram of another example of the disk repro- 
ducing apparatus according to the present inven- 
tion; and FIG. 40 is a block diagram of still another 
example of the disk reproducing apparatus accord- 45 
ing to the present invention. 

Best Mode of Carrying Out the Invention 

[0015] Hereinafter, referring to the accompanying 50 
drawings, an embodiment of the present invention will 
be explained. This invention relates to an audio data 
structure which facilitates the handling of high sound 
quality audio data and assures the high sound quality in 
processing (recording, reproducing, transferring, and 55 
constructing) the high sound quality audio data, a 
recording medium thereof, a processing apparatus 
thereof, and a processing method thereof. 
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[001 6] In the embodiment, explanation will be given as 
to a case where the present invention is applied to a 
system where the objects of contents (including various 
video contents and various audio contents) are shared. 
In addition, explanation will be given as to a case where 
the invention is applied to an information recording 
medium with management data used to share the 
objects of contents, an apparatus for reproducing the 
recorded information from the medium, a method of 
recording information including the management data 
on the medium, and a method of reproducing the infor- 
mation from the medium on the basis of the manage- 
ment data. 

[0017] FIG. 1 is a perspective view showing the con- 
figuration of an optical disk 10 that can be used as a 
DVD audio recording medium. As shown in FIG. 1, the 
optical disk 10 is such that two transparent substrates 
14 on each of which a recording layer 17 is provided are 
laminated together with an adhesion layer 20. Each 
substrate 14 is made of 0.6-mm-thick polycarbonate. 
The adhesion layer 20 is made of very thin (for example, 
40-nm-thick) ultraviolet-curing resin. The two 0.6-mm- 
thick substrates 14 are laminated together in such a 
manner that the recording layer 17 of each substrate is 
in contact with one surface of the adhesion layer 20, 
which produces a 1.2-mm-thick large-capacity optical 
disk 10. 

[0018] In the optical disk 10, a central hole 22 is made. 
Around the central hole 22 on both sides of the optical 
disk 10, clamp areas 24 for clamping the optical disk 10 
during rotation are provided. When the optical disk 10 is 
loaded into a disk drive unit (not shown), the spindle of 
a disk motor is inserted in the central hole 22. While the 
optical disk 10 is rotating, the disk is clamped by disk 
clampers (not shown) in the clamp areas 24. 
[0019] The optical disk 10 has an information area 25 
around the clamp areas 24 in which video data, audio 
data, and other pieces of information can be recorded. 
[0020] In the information area 25, a lead-out area 26 
is provided at the outer edge and a lead-in area 27 is 
provided at the inner edge adjacent to the clamp area 
24. A data recording area 28 is defined between the 
lead-out area 26 and lead-in area 27. 
[0021] In the recording layer (light reflecting layer) 17 
of the information area 25, recording tracks are formed 
continuously, for example, in a spiral. The continuous 
tracks are divided into physical sectors. Serial numbers 
are allocated to the sectors. Using the sectors as 
recording units, various types of data are recorded on 
the optical disk 10. 

[0022] The data recording area 28 is an actual data 
recording area including a DVD audio data recording 
area and a DVD video data recording area (the DVD 
video data recording area might not be used in a pure 
audio disk). 

[0023] In the DVD audio data recording area, audio 
data is chiefly written as recording and playback infor- 
mation in the form of pit trains (or in a physical shape or 
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phase that optically changes the laser reflected light). 
Depending on the situation, still picture data may be 
recorded in the DVD audio data recording area. The 
audio data recorded in the DVD audio data recording 
area can include completely silent data (not a silent por- 
tion in music but intentionally silent data). 
[0024] On the other hand, in the DVD video data 
recording area, video data (main picture data) for mov- 
ies, sub-picture data for subtitles and menus, and audio 
data for words and sound effects are recorded as 
recording and playback information in the form of pit 
trains. 

[0025] When the optical disk 10 is a single-sided sin- 
gle layer, double-sided recording DVD-RAM disk (or a 
rewritable disk; DVD-RW disk), each recording layer 17 
is composed of a triple layer formed by sandwiching a 
phase change recording material (e.g., Ge 2 Sb 2 Te 5 ) 
between two zinc sulfide • silicon oxide mixtures 
(ZnS • Si0 2 ). 

[0026] When the optical disk 1 0 is a single-sided sin- 
gle layer, single-sided recording RAM disk, the record- 
ing layer 17 on the reading face 19 side is composed of 
a triple layer including the phase change recording 
material layer. In this case, the layer 17 located on the 
opposite side when viewed from the reading face 19 
side need not be an information recording layer. It may 
be a simple dummy layer. 

[0027] When the optical disk 10 is a single-sided read- 
ing dual-layer RAM/ROM disk, two recording layers 1 7 
are composed of a single phase change recording layer 
(the rear side viewed from the reading face 19; for read- 
ing) and a single translucent metal reflecting layer (the 
front side viewed from the reading face 19; for play- 
back). 

[0028] When the optical disk 10 is a write-once DVD- 
R, polycarbonate is used for a substrate. Gold may be 
used for a reflecting film (not shown) and an ultraviolet- 
curing resin may be used as a protective film (not 
shown). In this case, organic pigment is used for the 
recording layer 17. Cyanine, squarilium, chroconic, 
triphenylmethane dyes, xanthene, quinone dyes (e.g., 
naphthoquine or anthraquinone), and metal complex 
dyes (e.g., phthalocyanine, porphyrin, dithiol Complex, 
and the like) may be used as the organic pigment. 
[0029] Data can be written onto such a DVD-R disk 
using, for example, a semiconductor laser with an out- 
put of about 6 to 12 mW at a wavelength of 650 nm. 
[0030] When the optical disk 1 0 is a single-sided read- 
ing, dual-layer ROM disk, two recording layers 17 are 
composed of a single metal reflecting layer (at the back 
viewed from the reading face 19) and a translucent 
metal reflecting layer (at the front viewed from the read- 
ing face 19). 

[0031] In a read-only DVD-ROM disk (for DVD audio 
and/or DVD video), pit trains are formed by a stamper 
on a substrate 14 in advance. On the surface of the sub- 
strate 14 on which the pit trains have been formed, a 
reflecting layer of metal or the like is formed. The reflect- 



ing layer is used as the recording layer 17. In such a 
DVD-ROM disk, groups serving as recording tracks are 
normally not provided. Instead, the pit trains formed at 
the surface of the substrate 14 function as tracks. 

5 [0032] In the various types of optical disk 1 0, the play- 
back-only ROM information is recorded as an emboss 
signal in the recording layer 17. In contrast, an emboss 
signal is not recorded on the substrate 14 having the 
read/write (or write-once) recording layer 17. Instead, a 

10 continuos groove is inscribed. The groove is provided 
with phase change recording layers and others. In the 
case of a read/write DVD-RAM disk, phase change 
recording layers in the land portions as well as the 
groove are used for information recording. 

15 [0033] When the optical disk 10 is of the single-sided 
reading type (with either one or two recording layers), 
the substrate 14 on the reverse side viewed from the 
reading face t9 is not necessarily transparent to a 
read/write laser beam. In this case, a label may be 

20 printed on the whole surface of the reverse-side sub- 
strate 14. 

[0034] FIG. 2 is a diagram to help explain a correlation 
between the data recording area 28 on the optical disk 
10 of FIG. 1 and recording tracks of data items recorded 

25 there. When the optical disk 10 is a DVD-RAM (or DVD- 
RW), the body of the optical disk 10 is housed in a car- 
tridge (not shown) to protect the delicate disk surfaces. 
When the DVD-RAM disk together with the cartridge is 
inserted in the disk drive of a DVD player explained 

30 later, the optical disk 10 is drawn out of the cartridge 
and clamped to the turn table of a spindle motor (not 
shown). Then, the disk is rotated in such a manner that 
it faces an optical head (not shown). 
[0035] On the other hand, when the disk 10 is a DVD- 

35 R or a DVD-ROM, the body of the optical disk 10 is not 
housed in a cartridge. The naked optical disk 10 is set 
directly in the disk tray of the disk drive. 
[0036] On the recording layer 17 in the information 
area 25 of FIG. 1, data tracks are formed continuously 

40 in a spiral. As shown in FIG. 2, the continuos tracks are 
divided into logical sectors (the minimum recording 
unit), each sector having a specific storage capacity. 
Data is recorded in logical sectors. The storage capacity 
of one logical sector is set at 2048 bytes (or 2 kilobytes) 

45 equal to the data length of one pack. 

[0037] The data recording area 28 is an actual data 
recording area, in which management data and sound 
data have been recorded for DVD audio and similarly 
management data, main picture (video) data, sub-pic- 

50 ture data, and sound data have been recorded. 

[0038] Although not shown, when the optical disk 10 
of FIG. 2 is a DVD-RAM disk, the data recording area 28 
may be divided into ring-like (annual-ring-like) recording 
areas (recording zones). In this case, the angular speed 

55 of the disk rotation differs from one recording zone to 
another. In each zone, however, the linear speed or 
angular speed can be made constant. When the optical 
disk 10 of FIG. 2 is a DVD-ROM disk, various data items 
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are recorded on al! of the data recording area 28 at a 
constant linear speed. 

[0039] FIG. 3 is a diagram to help explain the hierar- 
chical structure of those recorded in the DVD audio 
zone among the various pieces of information recorded 
on the optical disk 10 of FIG. 2. In FIG. 3, the data 
recording area 28 formed on the optical disk 10 has a 
structure shown in the figure. A logical format in the 
structure is determined so as to comply with, for exam- 
ple, ISO 9660, one of the standards, and the universal 
disk format (UDF) bridge. 

[0040] The data recording area 28 between the lead- 
in area 27 and the lead-out area 26 is allocated as a vol- 
ume space 28. The volume space 28 can include a 
space (volume/file structure area 70) for information on 
volume and file structure, a space (DVD audio zone 71 
and DVD video zone 72) for applications complying with 
the DVD standard, and a space (other recording areas 
73) for applications other than those complying with the 
DVD standard. 

[0041] The volume space 28 is physically divided into 
a large number of sectors. Serial numbers are allocated 
to the physical sectors. The logical addresses for the 
data items recorded in the volume space 28 mean logi- 
cal sector numbers as determined in ISO 9660 and the 
UDF bridge. Like the effective data size of a physical 
sector, the size of a logical sector is set at 2048 bytes (2 
kilobytes). The logical sector numbers are serial in such 
a manner that they correspond to ascending order of 
physical sector number. 

[0042] Unlike a logical sector, a physical sector is 
given redundant information, such as error correction 
information. To be precise, the physical sector size 
therefore does not correspond to the logical sector size. 
[0043] As shown in FIG. 3, the volume space 28 
includes a volume/file structure area 70, a DVD audio 
zone 71, a DVD video zone 72, and other recording 
areas 73. These areas 70 to 73 are separated at the 
boundaries of logical sectors shown in FIG. 2. Here, one 
logical sector is defined as containing 2048 bytes. One 
logical block is defined as containing 2048 bytes. Con- 
sequently, one logical sector is defined in the same 
manner as one logical block. 

[0044] The volume/file structure area 70 corresponds 
to the management area determined in ISO 9660 and 
the UDF bridge. On the basis of the description in the 
area 70, the contents of the audio manager (AMG) 71 1 
are stored in the system memory in the DVD player 
explained later. 

[0045] The DVD audio zone 71 is composed of a sim- 
ple audio manager (SAMG) 710, an audio manager 
(AMG) 711, an audio still video set (ASVS) 712, and 
one or more audio title sets (ATS #m) 713 (the maxi- 
mum number of audio title sets m is 99). 
[0046] The SAMG 710 is a single file containing 128 
kilobytes, into which a simple audio play pointer table 
(SAPPT) with the same contents has been written eight 
times. 



[0047] The AMG 71 1 is composed of an audio man- 
ager information (AMGI) file 7110, an audio manager 
menu video object set (AMGM_VOBS) file 7111, and an 
audio manager information backup (AMGI_BUP) file 
5 7112. The AMGM.VOBS file 7111 is an optional file 
and may be absent. 

[0048] The ASVS 712 is composed of an audio still 
video set information (ASVSI) file 7210, an audio still 
video object set (ASVOBS) file 7121, and an audio still 

10 video set information backup (ASVSLBUP) file 7122. 
[0049] Each ATS 713 is composed of an audio title set 
information (ATSI) file 7130, an audio-only title audio 
object set (AOTT.AOBS) file 7131, and an audio title 
set information backup (ATSI.BUP) file 7132. Here, the 

15 AOTT.AOBS file 7131 is made up of one to nine files. It 
is an optional file and may be absent. 
[0050] Referring to FIG. 4, AOTT_AOBS 7131 will be 
explained. As explained later, AOTT.AOBS 7131 
defines a set of one or more audio objects (AOB). Each 

20 AOB defines a set of one or more audio title set cells 
(ATS_C #). A set of one or more cells constitutes an 
audio title set program. A set of one or more programs 
constitutes an audio title set program chain (PGC). 
[0051] In FIG. 3, the structure of AOTT_AOBS 7131 is 

25 represented directly by a set of ATS_C #. Each PGC is 
expressed by the program chain information in the ATS. 
[0052] When one PGC is compared to an opera, cells 
constituting the PGC correspond to various music 
scenes or singing senses in the opera. The contents of 

30 the PGC (or the contents of the cells) are determined by 
a software provider that creates the contents recorded 
on an optical disk 10. Specifically, the provider can 
reproduce the cells constituting AOTT.AOBS 7131 as it 
has planned, using cell playback information 

35 ATS_C„PBI written in program chain information 
ATS.PGCI in the ATS. Explanation of ATS_PGCI and 
ATS_C_PBI will be given later. 
[0053] In the other recording areas 73, usable pieces 
of information in the DVD video zone 72 or other pieces 

40 of information unrelated to the DVD video zone 72 can 
be written. The recording area 73 is not indispensable 
and may be eliminated, if unnecessary. 
[0054] FIG. 5 is a diagram to help explain the hierar- 
chical structure of those recorded in the DVD video 

45 zone 72 among the various pieces of information 
recorded on the optical disk 10 of FIG. 2. Hereinafter, 
what has been explained in FIG. 3 will be omitted and 
only the part related to the DVD video zone 72 will be 
explained. 

so [0055] On the basis of the description in the vol- 
ume/file structure area 70, the contents of the video 
manager (VMG) 721 are stored in the system memory 
in the DVD player explained later. 
[0056] The DVD video zone 72 is composed of a video 

55 manager (VMG) 721 and one or more video title sets 
(VTS #n) 722 (the maximum number "n" of video title 
sets is 99). 

[0057] The VMG 721 is composed of a video manager 
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information (VMGI) file 7210, a video manager menu 
video object set (VMGM_VOBS) file 7211, and a video 
manger information backup (VMGLBUP) file 7212. 
Here, the VMGM_VOBS file 721 1 is an optional file and 
may be absent. 

[0058] Each VTS 722 is composed of a video title set 
information (VTSI) file 7220, a video title set menu video 
object set (VTSM_VOBS) file 7221, a video title set title 
video object set (VTSTT_VOBS) file 7222, and a video 
title set information backup (VTSLBUP) file 7223. Here, 
the VTSM_VOBS file 7221 is an optional file and may 
be absent. 

[0059] Stored in each video title set (VTS) 722 are not 
only the video data (or video packs explained later) 
compressed according to the MPEG standard, the 
audio data (audio packs explained later) compressed or 
uncompressed according to a specific standard, and 
run-length-compressed sub-picture data (sub-picture 
packs explained later, including bit-map data whose one 
pixel is defined using plural bits), but also information 
used to reproduce these data items (navigation packs 
explained later, including presentation control informa- 
tion and data search information). 
[0060] Referring to FIG. 6, VTSTT.VOBS 7222 will be 
explained. As explained later, VTSTT.VOBS 7222 
defines a set of one or more video objects (VOB). Each 
VOB defines a set of one or more video title set cells 
(VTS_C #n). VTS_C #n is composed of one or more 
video object units (VOBU). A VOBU may include navi- 
gation packs, audio packs, and sub-picture packs. A set 
of one or more video title set cells (VTS_C #n) consti- 
tutes a video title set (VTS) program. A set of one or 
more programs constitutes a video title set (VTS) pro- 
gram chain (PGC). 

[0061] FIG. 5 shows the relationship between a pro- 
gram chain (PGC) and video title set cells (VTS_C #n). 
[0062] When one PGC is compared to a drama, cells 
constituting the PGC can be considered to correspond 
to various scenes in the drama. The contents of the 
PGC (or the contents of the cells) are determined by a 
software provider that creates the contents recorded on 
an optical disk 10. Specifically, as with the ATS.PGCl 
explained in FIG. 3, the provider can reproduce the cells 
constituting VTSTT.VOBS 7222 as it has planned, 
using the cell playback information (not shown) written 
in program chain information (VTS_PGCI) in the VTS. 
[0063] FIG. 7 is a diagram to help explain a case 
where specific pieces of video information (VTS_C #2, 
VTS.C #3, VTS_C #5) are accessed (in different meth- 
ods) by both the program chain information (ATS_PGCI) 
in the DVD audio zone 71 of FIG. 3 and the program 
chain information (VTS_PGCI) in the DVD video zone 
72 of FIG. 5. In other words, FIG. 7 shows a case where 
the same video objects (VOB) are referred to in different 
methods by the audio reproducing unit and video repro- 
ducing unit. 

[0064] Specifically, when video playback is carried out 
from the video title set (VTS) side, cells VTS.C #1 to 



VTS.C #6 in the VOB are reproduced in sequence on 
the basis of the cell playback information (not shown) in 
the VTS.PGCI. 

[0065] On the other hand, when video playback (or still 

5 playback) is carried out from the audio title set (ATS) 
side, cells VTS.C #2, VTS.C #3, and VTS.C #5 in the 
VOB are selectively reproduced on the basis of the cell 
playback information (ATS_C_PBI) in the ATS.PGCI. 
[0066] In this case, because neither the ATS nor the 

10 VTS needs to have the same cell data items (VTS.C 
#2, VTS_C #3, and VTS.C #5) separately on the same 
optical disk 10, it is possible to use the limited storage 
capacity of the optical disk 1 0 effectively. 
[0067] FIG. 4 shows an example of the data structure 

15 of the recorded contents (AOTT.AOBS) in the DVD 
audio zone 71 of FIG. 3. The AOTT.AOBS 7131 
explained in FIG. 3 defines a set of one or more audio 
objects (AOTT_AOB #) as shown in FIG. 4. Each 
AOTT.AOB defines a set of one or more audio title set 

20 cells (ATS_C #). A set of one or more cells (ATS_C #) 
constitutes a program. A set of one or more programs 
constitutes a program chain (PGC). This PGC consti- 
tutes a logical unit indicating the whole of or part of an 
audio title. 

25 [0068] In the example of FIG. 4, each audio title set 
cell (ATS_C #) is composed of a set of 2048-byte audio 
packs (A_PCK). These packs are the smallest units in 
performing a data transfer process. The smallest unit in 
logical processing is a cell. Logical processing is done 

30 in cells. 

[0069] FIG. 6 shows an example of the data structure 
of the recorded contents (VTSTT.VOBS) in the DVD 
video zone 72 of FIG. 5. 

[0070] As shown in FIG. 6, the VTSTT.VOBS 7222 

35 explained in FIG. 5 defines a set of one or more video 
objects (AOB #). Each VOB defines a set of one or more 
video title set cells (VTS.C #). Each VTS_C defines a 
set of one or more video object units (VOBU). A set of 
one or more video title set cells constitutes a program. A 

40 set of one or more programs constitutes a program 
chain (PGC). The PGC constitutes a logical unit indicat- 
ing the whole of or part of a video title or visual menu. 
[0071] As shown in FIG. 6, each VOBU is a collection 
(a pack train) of a navigation pack, video packs (MPEG- 

45 compressed moving picture data), sub-picture packs 
(run-length-compressed bit map data), and audio packs 
(uncompressed linear PCM audio data or compressed 
multichannel audio data), with the navigation pack at the 
head. Specifically, the video object unit (VOBU) is 

so defined as a collection of alt the packs starting from a 
navigation pack to the one just before the next naviga- 
tion pack. The navigation pack is incorporated in each 
VOBU to realize angle change (seamless angle change 
playback or nonseamless angle change playback). 

55 [0072] Those packs are used as the smallest units in 
transferring data as in FIG. 4. The smallest unit in logi- 
cal processing is a cell. Logical processing is done in 
cells. 
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[0073] The playback time of the VOBU corresponds to 
the playback time of the video data made up of one or 
more video groups (Groups of Pictures, abbreviated as 
GOPs) contained in the VOBU. The playback time is set 
in the range from 0.4 second to 1.2 seconds. In the 
MPEG standard, the playback time of one GOP is nor- 
mally about 0.5 second. One GOP contains screen data 
compressed so that about 15 pictures may be repro- 
duced in about 0.5 second. 

[0074] When the VOBU includes video data, GOPs 
(complying with the MPEG standard) composed of 
video packs, sub-picture packs, and audio packs are 
arranged to produce a video data stream. The VOBU is 
determined on the basis of the playback time of the 
GOPs, regardless of the number of GOPs. At the head 
of the VOBU, a navigation pack is always placed. 
[0075] In DVD video playback, even when the play- 
back data contains only audio and/or sub-picture data, it 
is constructed using a VOBU as one unit. For example, 
when a VOBU is made up of only audio packs, with a 
navigation pack at the head, the audio packs to be 
reproduced within the playback time (0.4 second to 1.2 
seconds) of the VOBU to which the audio data belongs 
are stored in the VOBU, as in the video data VOBU. 
[0076] As shown in FIG. 6, the VTSTT.VOBS is 
defined as a set of one or more VOBs. The VOBs in the 
VOBS are used for the same application. A menu VOBS 
is usually composed of one VOB, in which menu screen 
display data items are stored. In contrast, a video title 
set VOBS is usually composed of more than one VOB. 
[0077] When a concert video for a certain rock band 
is taken as an example, VOBs constituting a video 
object set (VTSTT_VOBS) for title sets can be consid- 
ered to correspond to the video data for the perform- 
ance of the band. In this case, by specifying particular 
VOBs, for example, the third piece on the band's con- 
cert program can be reproduced. 
[0078] In the VOBs constituting a video object set 
(VTSMJVOBS) for menus, the menu data for all the 
pieces of the band's concert program is stored. Accord- 
ing to the menu on the screen, a specific piece of music, 
for example, an encore, can be reproduced. 
[0079] In an ordinary video program, one VOBS can 
be composed of one VOB. In this case, one video 
stream is completed with a single VOB. 
[0080] On the other hand, for example, in the case of 
a collection of animations with multiple stories or omni- 
bus movies, plural video streams [plural program chains 
(PGCs)] can be provided for each story in one VOBS. In 
this case, each video stream is stored in the corre- 
sponding VOB. At that time, the audio stream and sub- 
picture stream related to each video stream are also 
completed in each VOB. 

[0081] Each video object (VOB) is assigned an identi- 
fication number (#i; i = 0 to i). By the identification 
number, the VOB can be identified. A VOB is composed 
of one or more cells. An ordinary video stream is made 
up of plural cells. A video stream for menus may be 



composed of one cell. Like the VOB, each cell is 
assigned an identification number (#j; j = 1 to j). 
[0082] FIG. 8 shows the recorded contents in the 
user-accessible DVD audio zone 71 to help explain an 

5 example of the data structure recorded on one side (of 
one or two layers) of the optical disk 10 shown in FIG. 1. 
[0083] In DVD audio, a hierarchical structure com- 
posed of albums, groups, tracks, and indexes is pre- 
pared as a management structure for recorded contents 

10 viewed from the software producer side. 

[0084] An album corresponds to one side of a DVD 
audio disk. For example, "The first volume of works by 
Beethoven" can be allocated to the album. In this case, 
the album may be composed of group #1 of Symphony 

15 No. 1 to group #9 of Symphony No. 9. 

[0085] Each group (e.g., group #1) is composed of the 
first to fourth movements of the corresponding sym- 
phony (Symphony No. 1). Each track is composed of 
indexes #1 to #i, which are obtained by dividing a track 

20 (e.g., track #1 ) into i pieces. 

[0086] When the user plays back a DVD audio disk 
with such a hierarchical structure as is shown in FIG. 8, 
the user sets the optical disk 10 in the DVD audio player, 
operates the remote controller (not shown), and selects 

25 group #1 and track #1 . 

[0087] After the selection, when the user presses the 
playback button on the remote controller, the DVD audio 
player starts to reproduce Beethoven's symphony No. 1 , 
starting at the first movement. When the user specifies 

30 a specific index from the remote controller, the specified 
index portion is searched for and playback is started at 
that portion. The first index part of the first track in the 
first group in the album can be reproduced in default, or 
even when the user specifies nothing. 

35 [0088] In playing back a DVD disk, the user can rec- 
ognize the title (such as, the title of a specific movie), 
whereas in playing back a DVD audio disk, the user can- 
not see the title. What the user can see are only the 
album, groups, tracks, and indexes shown in FIG. 8. 

40 [0089] FIG. 9 shows the directory structure of the 
information (DVD audio and DVD video data files) 
recorded on the optical disk 10 shown in FIG. 1. The 
structure is an example of a file directory structure 
defined in the DVD file standard. 

45 [0090] As in a hierarchical file structure used by a gen- 
eral-purpose computer operating system, a subdirec- 
tory of video title set (VTS), a subdirectory of audio title 
set (ATS), and a user-defined directory are connected 
to a root directory. 

so [0091] Specifically, in the subdirectory of video title set 
(VTS), various video files (including VMGI, VMGM, 
VTSI, VTSM, and VTS files) as shown in FIG. 5 are so 
arranged that the individual files can be managed in 
order. 

55 [0092] Moreover, in the subdirectory of audio title set 
(ATS), various audio files (including AMGI, ATSI, and 
ATS files) as shown in FIG. 3 are so arranged that the 
individual files can be managed in order. 
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[0093] The user can access a specific file (for exam- 
ple, a specific VTS or a specific ATS) by specifying a 
path from the root directory to the file. 
[0094] When a DVD video player produced according 
to the DVD video standard plays back a DVD video disk 
produced according to the DVD video standard, it first 
reads the management information (VMG) in the video 
title set (VTS) directory under the root directory and 
reproduces the video contents on the basis of the infor- 
mation. However, what can be reproduced according to 
the VMG is limited to the video contents (VTS) recorded 
in the VTS directory. 

[0095] On the other hand, when a DVD audio player 
(or a DVD video-DVD audio compatible player) pro- 
duced according to the DVD audio standard plays back 
a DVD audio disk produced according to the DVD audio 
standard, it first reads the management information 
(AMG) in the audio title set (ATS) directory under the 
root directory and reproduces the audio contents on the 
basis of the information. 

[0096] In this case, what can be reproduced according 
to the AMG is not limited to the audio contents (ATS) 
recorded in the ATS directory. The video contents (VTS) 
in the VTS directory can also be reproduced (the repro- 
ducing method will be explained later). 
[0097] FIG. 10 shows another example of the direc- 
tory structure of the information (DVD audio and DVD 
video data files) recorded on the optical disk 10 shown 
in FIG. 1. In the example of FIG. 9, the VTS directory 
and the ATS directory are placed in the same level of 
hierarchy under the root directory. On the other hand, in 
the example of FIG. 10, the ATS directory (child direc- 
tory) is placed in a level of hierarchy under the root 
directory (parent directory). The VTS directory (grand- 
child directory) is placed in a level of hierarchy under the 
ATS directory. 

[0098] FIG. 1 1 is a diagram to help explain the way the 
directory on the audio content side accesses a file in the 
directory on the video content side in the directory 
structure shown in FIG. 9. 

[0099] Specifically, in the hierarchical management 
structure for managing the data files recorded on the 
optical disk 10, the video title set directory (a child direc- 
tory) and the audio title set directory (a child directory) 
are placed under the root directory (a parent directory). 
[0100] The video title set directory (VTS directory) is 
a directory for dealing with the video content files 
recorded on the optical disk 10 and includes a video 
manger (VMG) file and one or more video title set (VTS) 
files (video content logical units) (see FIG. 5). 
[0101] The audio title set directory (ATS directory) is a 
directory for dealing with the audio content files 
recorded on the optical disk 10 and includes an audio 
manger (AMG) file and one or more audio title set (ATS) 
files (audio content logical units) as well as the afore- 
mentioned SAMG and ASVS (not shown in FIG. 11) 
(see FIG. 3). 

[0102] The VMG in the VTS directory manages only 



the VTS and is designed to access only the VTS in the 
VTS directory. 

[0103] On the other hand, the AMG in the ATS direc- 
tory manages mainly the ATS and is designed to access 
5 not only the ATS in the ATS directory but also the VTS 
in the VTS directory. 

[0104] The AMG includes audio manger information 
(AMGI). The AMGl includes an audio title search pointer 
table (ATT_SRPT). The ATT_SRPT includes an audio- 

10 only title (AOTT) search pointer (ATT_SRP) and an 
audio video (AVTT) search pointer (ATT_SRP). The 
contents of these will be explained in detail later. 
[0105] Specifically, the AMG in the ATS directory can 
access the audio title sets (ATS #1, ATS #2, ...) in the 

15 ATS directory using the AOTT search pointer 
ATT_SRPT. It can also access the video title sets (VTS 
#1, VTS #2, ...) in the VTS directory using the AVTT 
search pointer (ATT.SRPT). This enables a certain 
object (such as, VTS #1) to be shared by both the video 

20 contents and the audio contents. This is one of the 
important characteristics of "an object sharing system" 
according to the present invention. 
[0106] FIG. 12 is a diagram to help explain a case 
where a file in the directory on the audio content side 

25 links with a file in the directory on the video content side. 
FIG. 12 can be considered to be a modification of FIG. 
11. 

[0107] Specifically, in the example of FIG. 11, the 
audio manager (AMG) is designed to be able to access 
30 both an audio title set (ATS) and a video title set (VTS). 
This enables a VTS to be shared by the video contents 
and audio contents. 

[0108] On the other hand, in the example of FIG. 12, 
information (e.g., a pointer indicating an address for a 

35 specific part of VTS #1) to link with a video title set 
(here, VTS #1) is written in an audio title set (here, ATS 
#1 ). This enables, for example, the audio data in VTS #1 
to be shared by the video contents and audio contents. 
[0109] FIG. 1 3 shows a data structure to help explain 

40 an example of how file access in FIG. 1 1 is carried out 
in the volume space 28 shown in FIG. 3 or 5. The data 
structure of FIG. 13 corresponds to the directory struc- 
ture of FIG. 11. 

[0110] In FIG. 13, the shaded portions indicate exam- 
45 pies of the contents shared by the video contents (or 
video volume) and the audio contents (or audio vol- 
ume). 

[0111] The basic idea of the data structure of FIG. 13 
is to record a recording area (VMG + VTS) for video 

50 contents and a recording area (AMG + ATS) for audio 
contents in the volume space 28 independently and 
enable video contents shared by both video and audio 
uses to be managed by the AMG. 
[0112] Specifically, in FIG. 13, the video title set (VTS 

55 #1 ) managed by the VMG can access part (cells) of the 
video object set (VOBS #1) and the audio title set (ATS 
#1) managed by the AMG can access the other part 
(cells) of VOBS #1 . In this example, part of the cells con- 
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stituting the video object set (VOBS #1) of VTS #1 are 
shared by the video contents and audio contents. 
[01 1 3] In the data structure of FIG. 1 3, the DVD audio 
zone 71 is placed in locations with lower addresses 
(closer to the lead-in area 27 in FIG. 3). The DVD video 
zone 72 is placed in locations with higher addresses 
(closer to the lead-out area 26 in FIG. 3). In this case, 
the AMG has only to always use increasing addresses, 
when it accesses either ATS or VTS, and need not deal 
with decreasing addresses. This facilitate the construc- 
tion of the reproducing system. 
[0114] FIG. 14 shows a data structure to help explain 
another example of how file access in FIG. 1 1 is carried 
out in the volume space 28 shown in FIG. 3 or 5. 
Namely, FIG. 14 can be considered to be a modification 
of FIG, 13. 

[0115] In FIG. 13, because the DVD audio zone 71 is 
placed in locations with lower addresses and the DVD 
video zone 72 is placed in locations with higher 
addresses, the AMG need not handle decreasing 
addresses. 

[0116] On the other hand, in FIG. 14, the DVD video 
zone 72 is placed in locations with lower addresses 
(closer to the lead-in area 27 in FIG. 3). The DVD audio 
zone 71 is placed in locations with higher addresses 
(closer to the lead-out area 26 in FIG. 3). In this case, 
the AMG uses increasing addresses when it accesses 
the ATS and decreasing addresses when it accesses 
the VTS. In this case, addressing in accessing a desired 
object (a cell in the ATS or VTS) is troublesome. There- 
fore, it is difficult to apply the technique to commercial- 
use DVD audio players for which product costs are a 
problem. 

[0117] However, when a personal computer with a 
DVD drive is turned into a DVD audio player, the cost 
problem can be avoided even when the data structure of 
FIG. 14 has been employed. Specifically, the operating 
system (or the control software) of a personal computer 
whose data structure of FIG. 14 has been analyzed can 
convert the data structure with the physical arrange- 
ment of FIG. 14 into the arrangement of FIG. 13 in 
appearance by remapping the addresses on its mem- 
ory. By doing this, the MPU or CPU of the personal com- 
puter can access either ATS or VTS from the AMG by 
specifying increasing addresses. 
[0118] FIG. 15 shows a data structure to help explain 
still another example of how file access in FIG. 1 1 is car- 
ried out in the volume space 28 shown in FIG. 3 or 5. 
Namely, FIG. 15 can be considered to be a modification 
of FIG. 13. 

[0119] In FIG. 13, because the DVD audio zone 71 is 
placed in locations with lower addresses and the DVD 
video zone 72 is placed in locations with higher 
addresses, the AMG need not handle decreasing 
addresses. 

[0120] In contrast, in the data structure of FIG. 1 5, the 
AMG in the DVD audio zone 71 is placed in locations 
with lower addresses (closer to the lead-in area 27 in 



FIG. 3). The VMG in the DVD video zone 72 is placed in 
locations with higher addresses (closer to the lead-out 
area 26 in FIG. 3). In this case, the AMG has only to 
always use increasing addresses, when it accesses 

5 either the ATS and the VTS, and need not deal with 
decreasing addresses. Therefore, as in FIG. 13, it is 
easy to construct a reproducing system. 
[0121] Because the data structure of FIG. 15 is a 
nested structure where VTS #1 is placed in ATS #1, the 

w VMG of FIG. 5 cannot recognize that the VTS in the ATS 
exists in the DVD video zone 72. In this case, the VMG 
can treat the VTS in the ATS as existing in the other 
recording areas 73. 

[0122] The data structure of FIG. 15 can be used 
15 when the other recording areas 73 are used in a case 
where the AMG is allowed to access not only the ATS 
but also the VTS. 

[01 23] Three examples of the data structure that ena- 
bles the AMG to access not only the ATS but also VTS 
20 have shown in FIGS. 13 to 15. The most favorable one 
is the data structure of FIG. 13. The reason is that a 
desired common object can be accessed by just speci- 
fying increasing addresses without remapping the 
addresses. 

25 [0124] FIG. 16 is a diagram to help explain the 
recorded contents of the audio manager information 
(AMG I) in the DVD audio zone 71 shown in FIG. 3. 
[0125] The contents the DVD audio zone 71 deals with 
include two types of titles: an audio-only title (AOTT) 

30 and a video-added audio title [or audio-video title 
(AVTT)]. 

[0126] The AOTT is a title in an audio disk (A disk) 
without a video section and is defined in the ATS 
recorded under the audio title set directory. On the other 

35 hand, the AVTT is a title in an audio-video disk (AV disk) 
with a video section and is defined in the VTS recorded 
under the video title set directory. The AOTT and AVTT 
are generally called ATT (audio title). 
[0127] The DVD audio zone 71 in which the ATT data 

40 is recorded is composed of SAMG 710, AMG 711, 
ASVS 712, and one or more (up to 99) audio title sets 
(ATS #1 to ATS #m)713. 

[0128] The AMG 71 1 is composed of an audio man- 
ager information (AMGI) file 7110, an audio manger 
45 menu video object set (AMGM_VOBS) file (optional file) 
7111, and an audio manger intonation backup 
(AMGI_BUP)file7112. 

[0129] The AMGI file 7110 includes an audio manager 
information management table (AMGLMAT), an audio 

so title search pointer table (ATT_SRPT), an audio-only 
title search pointer table (AOTT_SRPT), an audio man- 
ger menu program chain information unit table 
(AMGM_PGCI_UT), and an audio text data manager 
(ATXTDT_MG). 

55 [0130] Specifically, the AMG 711 has two pieces of 
search information (ATT.SRPT) and (AOTT_SRPT). 
Here, the ATT_SRPT is a table in which search informa- 
tion for both the AOTT and the AVTT has been written. 
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The AOTT_SRPT is a table in which search information 
only for the AOTT has been written. 
[0131] The reason why the search information is not 
divided into AVTT search information and AOTT search 
information, but into ATT (the generic name for AOTT 
and AVTT) search information (ATT.SRPT explained 
later) and AOTT search information (AOTT_SRPT 
explained later) is to simplify the reproducing method for 
various types of DVD players. 

[0132] FIG. 17 shows the recorded contents of the 
audio manager information management table 
(AMGLMAT) included in the audio manager information 
shown in FIG. 16. 

[0133] The audio manger information management 
table (AMGLMAT) includes an audio manager identifier 
(AMGJD), an audio manager end address (AMG_EA), 
an audio manager information end address (AMGI_EA), 
the version number (VERN) of the standard employed 
by the optical disk (DVD audio disk) 1 0, a volume setting 
identifier (VLMSJD), autoplay information (APJNF), an 
audio still video set start address (ASVS.SA), the 
number of title sets (TS_Ns), a provider (software pro- 
ducer and/or seller) identifier (PVRJD), an audio man- 
ager information management table end address 
(AMGI_MAT_EA), an audio manager menu video object 
set start address (AMGM_VOBS_SA), an audio title 
search pointer table start address (ATT.SRPT.SA), an 
audio-only title search pointer table start address 
(AOTT_SRPT_SA), an audio manger menu program 
chain information unit table start address 
(AMGM.PGCI.UT.SA), an audio text data manager 
start address ( ATXTDT_ M G_S A) , a video attribute 
(AMGM_V_ATR) for an audio manager menu video 
object set, the number of sub-picture streams 
(AMGM_SPST_Ns) for an audio manager menu, an 
attribute (AMGM_SPST_ATR) for sub-pictures of an 
audio manager menu video object set, the number of 
audio streams (AMGM_AST_Ns) for an audio manager 
menu, an audio attribute (AM G M_AST_ATR) for an 
audio manager menu video object set, and other reser- 
vation areas. 

[0134] In the audio manager menu video object set 
start address (AMGM.VOBS.SA), the start address of 
the AMGM_VOBS is written in the relative number of 
blocks counted from the first logical block in the AMG. 
When no AMGM_VOBS is present, "00000000 (h) n is 
written in the AMGM_VOBS_SA. 
[0135] In the start address (ATT.SRPT.SA), the start 
address of the ATT.SRPT is written in the relative 
number of blocks counted from the first logical block in 
theAMGI. 

[0136] In the start address (AOTT_SRPT_SA), the 
start address of the AOTT_SRPT is written in the rela- 
tive number of blocks counted from the first logical block 
in the AMGI. 

[0137] From the ATT_SRPT_SA or AOTT_SRPT_SA 
written in the AMGLMAT of FIG. 17, it can be found on 
which part of the optical disk 10 the search pointer to 



the audio title (ATT.SRPT) or the search pointer to the 
audio-only title (AOTT_SRPT) has been written. 
[0138] FIG. 18 is a diagram to help explain the con- 
tents of the audio title search pointer table (ATT.SRPT) 

5 included in the audio manager information of FIG. 16. 
The AMGI has two types of search pointers 
(ATT_SRPT) and (AOTT.SRPT). FIG. 18 shows a 
search pointer (ATT.SRP) that can access not only 
AOTT but also AVTT. 

10 [0139] Specifically, the ATT.SRPT included in the 
AMGI includes audio title search pointer table informa- 
tion (ATT.SRPTI) and one or more audio title search 
pointers [ATT.SRP (ATT.SRP #1 to ATT_SRP #n)]. 
The ATT.SRPTI includes the number of audio title 

15 search pointers and the ATT.SRPT end address. 

[0140] FIG. 19 is a diagram to help explain the con- 
tents of each audio title search pointer (here, ATT_SRP 
#n) included in the search pointer table (ATT.SRPT) of 
the audio title shown in FIG. 18. 

20 [0141] The DVD audio standard has been determined 
so as to deal with not only sound but also images. The 
AMG has two pieces of search information (ATT_SRPT) 
and (AOTT.SRPT). The ATT.SRPT of FIG. 19 is a 
table in which both AOTT search information and AVTT 

25 search information have been written. 

[0142] In FIG. 19, the audio-only title (AOTT) audio 
title search pointer (ATT_SRP) includes an audio title 
(ATT) category, the number of programs in the audio 
title (ATT), reservation, the total playback time of the 

30 audio title (ATT), the number of the audio title set (ATS), 
the title number of the audio title set (ATS), and the start 
address of the audio title set (ATS). 
[0143] The video-added audio title (AVTT) search 
pointer (ATT.SRP) includes an audio title (ATT) cate- 

35 gory, the number of programs in the audio title (ATT), 
the number of angles included in video, reservation, the 
total playback time of the audio title (ATT), the number 
of the video title set (VTS), the title number of the video 
title set (VTS), and the start address of the video title set 

40 (VTS). 

[0144] FIG. 20 is a diagram to help explain the con- 
tents of the audio-only title search pointer table 
(AOTT.SRPT) included in the audio manager informa- 
tion (AMGI) shown in FIG. 16. The AMGI has two types 
45 of search pointers (ATT.SRPT) and (AOTT.SRPT). 
FIG. 20 shows a search pointer (AOTT.SRP) that can 
access only the AOTT. 

[0145] Specifically, the AOTT.SRPT included in the 
AMGI includes audio-only title search pointer table infor- 

50 mation (AOTT.SRPTI) and one or more audio-only title 
search pointers [AOTT.SRP (AOTT.SRP #1 to 
AOTT.SRP #m)]. The AOTT.SRPTI includes the 
number of audio-only title search pointers and the end 
address of the AOTT.SRPT. 

55 [0146] FIG. 21 is a diagram to help explain the con- 
tents of an audio-only title search pointer (here, 
AOTT.SRP #m) included in the search pointer table 
(AOTT.SRPT) of the audio-only title shown in FIG. 19. 
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[0147] The DVD audio standard has been determined 
so as to deal with not only sound but also images. The 
AMG has two pieces of search information (ATT_SRPT) 
and (AOTT.SRPT). The AOTT_SRPT of FIG. 21 is a 
table in which only AOTT search information has been 
written. 

[0148] In FIG. 21, the audio-only title search pointer 
(AOTT_SRP) includes an audio title (ATT) category, the 
number of programs in the audio-only title (AOTT), res- 
ervation, the total playback time of the audio-only title 
(AOTT), the number of the audio title set (ATS), the title 
number of the audio title set (ATS), and the start 
address of the audio title set (ATS). 
[0149] In the playback title control information defined 
in the audio manger (AMG), a title group (TT_GR) can 
be specified. 

[0150] The title group (TT_GR) is a collection of one 
or more audio titles (ATT) and is defined as a unit that 
assures continuous playback of ATT groups. From the 
user's viewpoint, the audio title (ATT) corresponds to a 
piece of music and the title group (TT_GR) corresponds 
to an album, a collection of pieces (see FIG. 8). With a 
record or a CD, when playback is started at the head of 
the album or in the middle of a piece, the album can be 
played back continuously until the end of the album has 
been reached. Similarly, when playback is started at the 
head of the TT_GR or in the middle of the ATT, the 
TT_GR can be played back continuously until the end of 
the TT_GR has been reached. 
[0151] The following two types can be defined as a 
title group (TT_GR). 

( A1 > Audio title group (ATT_GR): this ATT_GR is a 
title group (TT_GR) composed of audio titles (ATT) 
defined in the audio title search pointer table 
(ATT.SRPT). 

<A2> Audio-only title group (AOTT.GR): this 
AOTT_GR is a title group (TT_GR) composed of 
audio-only titles (AOTT) defined in the audio-only 
title search pointer table (AOTT.SRPT). 

[0152] The audio title group (ATT_GR) is for a player 
that can reproduce images and sound complying with 
the audio standard (a player that handles both AOTT 
and AVTT). The audio-only title group (AOTT_GR) is for 
a player that can reproduce only sound conforming to 
the audio standard (a player that deals with only AOTT). 
[0153] The structure of the audio title (ATT) has the 
following three types: ~ 

(B1> ATT has only AOTT. 
(B2> ATT has only AVTT. 
<B3> ATT has both AOTT and AVTT. 

[0154] Here, AOTT and AVTT are the same in a piece 
of music, but AOTT is an pictureless version and AVTT 
is a picture-added version. 

[0155] In the case of ( B1 > , the AOTT search informa- 



tion is written in both of the ATT_SRPT and 
AOTT.SRPT (see FIGS. 19 and 21). 
[01 56] In the case of < B2 > , the AVTT search informa- 
tion is written only in the ATT_SRPT (see FIG. 19). 
5 [0157] In the case of ( B3 ) , the AOTT search informa- 
tion is written only in the AOTT_SRPT and the AVTT 
search information is written only in the ATT_SRPT (see 
FIG. 19). 

[0158] FIG. 22 shows the relationship between ( B1 ) , 
10 <B2), and <B3). FIG. 22 shows the relationship 
between the audio-only title group (AOTT.GR) 
accessed using the audio-only title search pointer 
(AOTT_SRP) in the audio manager information (AMGI) 
of FIG. 1 6 and the audio title group (ATT_GR) accessed 
is using the audio title search pointer (ATT_SRP) in the 
audio manager information (AMGI). It can be said that 
FIG. 22 shows an example of the relationship between 
ATT_SRPT and AOTT.SRPT. 

[0159] In FIG. 22, the audio titles (ATT #1) and (ATT 

20 #9) are each composed of only video-added audio titles 
(AVTT). ATT #2 and ATT #8 are each composed of 
video-added audio titles (AVTT) and audio-only titles 
(AOTT). ATT #3 to ATT #7 are each composed of audio- 
only titles (AOTT). 

25 [0160] In FIG. 22, nine audio titles (ATT) are used. 
These are divided into four groups (GR #1 to GR #4), 
which constitute an audio title group (ATT_GR). The 
nine titles are divided into two groups (GR #1, GR #2), 
which constitute an audio-only title group (AOTT_GR). 

30 [0161] In this example, the audio titles (ATT #1) and 
(ATT #9) are composed of only AVTT and include no 
AOTT. Consequently, ATT #1 and ATT #9 do not exist as 
the audio-only title group (AOTT_GR). 
[0162] Therefore, the number (four in the example) of 

35 audio title groups (ATT_GR) generally does not coincide 
with the number (two in the example) of audio-only title 
groups (AOTT_GR). 

[01 63] What is important here is to keep the identity of 
the title group (TT_GR) in both cases where ATT groups 

40 are reproduced on a player capable of reproducing 
images and sound complying with the audio standard 
(or a player that deals with both AOTT and AVTT) and 
where ATT group are reproduced on a player capable of 
reproducing only sound conforming to the audio stand- 

45 ard (a player that deals with only AOTT). 

[0164] Specifically, the corresponding ATT_GR and 
AOTT_GR have to be composed of the same ATT even 
when they differ in the GR number and have the same 
order of ATT in the TTJ3R. Otherwise, the user gets 

so confused. This does not apply to the ATT (ATT #1 and 
ATT #9 in FIG. 22) where only AVTT is present and no 
AOTT exists. 

[0165] To meet the above-described requirements, 
restrictions should be placed in such a manner that 
55 "ATT not defined as AOTT" is prevented from mixing 
with "ATT defined as AOTT" in a one ATT_GR. This 
maintains the identity of TT_GR in a portion where both 
ATT_GR and AOTT_GR exist. 
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[0166] In the example of FIG. 22, each of ATT.GR #2 
and AOTT_GR #1, and ATT_GR #3 and AOTT.GR #2 
is composed of the same ATT and has the same order 
of ATT in the TT_GR. 

[0167] FIG. 23 is a diagram to help explain the 
recorded contents of the audio title set (ATS) in the DVD 
audio zone 71 of FIG. 3. 

[0168] The audio title set (ATS) is composed of audio 
title set information (ATSI), an audio-only title audio 
object set (AOTT_AOBS), and audio title set information 
backup (ATSLBUP). 

[0169] The audio title set information (ATSI) includes 
an audio title set information management table 
(ATS l_ MAT) and an audio title set program chain infor- 
mation table (ATS_PGCIT). 

[0170] The audio title set program chain information 
table (ATS.PGCIT) includes audio title set program 
chain information table information (ATS_PGCITI) t an 
audio title set program chain information search pointer 
(ATS_PGCI_SRP), and one and more pieces of audio 
title set program chain information (ATS.PGCI). 
[0171] FIG. 24 is a diagram to help explain the 
recorded contents of the audio title set information man- 
agement table (ATS l_ MAT) of FIG. 23. 
[01 72] Specifically, the audio title set information man- 
agement table (ATSLMAT) includes an audio title set 
identifier (ATSJD), the end address of the audio title set 
(ATS„EA), the end address of the audio title set infor- 
mation (ATS LEA), the version number (VERN) of the 
employed audio standard, the end address of the 
audio title set information management table 
(ATSI_MAT_EA), the start address (VTS_SA) of the 
audio-only title AOTT video title set (VTS), the start 
address of the audio-only title audio object set 
(AOTT_AOBS_SA) or the start address of audio-only 
title video object set (AOTT_VOBS_SA), the start 
address of the audio title set program chain information 
table (ATS_PGCIT_SA), the attributes for the audio-only 
title audio object (AOTT_ AO B_ ATR) or the attributes for 
the audio-only title video object ( AOTT_ VO B_ ATR) #0 
to #7, audio title set data mix coefficients 
(ATS_DM_COEFT) #0 to #15, and other reservation 
areas. 

[0173] When the ATS has no AOTT.AOBS, the start 
address of the VTS including the VTSTT_VOBS (see 
FIG. 6) for AOTT is written in the start address 
(VTS_SA) of the audio-only video title set. When the 
ATS has AOTT_AOBS, "OOOOOOOOh" is written in the 
VTS.SA. 

[0174] When the ATS has AOTT_AOBS, the start 
address of AOTT_AOBS is written in the 
AOTT_AOBS_SA in the relative number of logical 
blocks counted from the first logical block in the ATS. On 
the other hand, when the ATS has no AOTT_AOBS, the 
start address of VTSTT_VOBS is written in the 
AOTT_VOBS_SA in the relative number of blocks 
counted from the first logical block in the VTS including 
the VTSTT_VOBS used for the ATS. 



[0175] In the ATS_PGCIT_SA, the start address of 
ATS_PGCIT is written in the relative number of blocks 
counted from the first logical block in the ATSI. 
[0176] The number of the aforementioned 

5 AOTT_ AO B_ ATR or AOTT_ VO B_ ART prepared is 8, 
from #0 to #7. When the ATS has AOTT_AOBS, the 
attribute of the AOTT_AOB recorded in the ATS is writ- 
ten in the AOTT_AOB_ATR. On the other hand, when 
the ATS has no AOTT_AOBS, the attribute of the audio 

10 stream in the VOB used in the AOTT_VOB in the ATS is 
written in the AOTT_VOB_ART. In the AOTT_AOB_ATR 
or AOTT_VOB_ART, the employed sampling frequency 
(44 to 192 kHz) and the number of quantization bits (16 
to 24 bits) are written. 

15 [0177] The ATS_DM_COEFT indicates coefficients 
used to mix down the audio data with a multichannel 
output (5.1 channel output) to a two channel output and 
is used only in one or more AOTT_AOB recorded in the 
ATS. When the ATS has no AOTT.AOBS, "Oh" is written 

20 in all the bits in each of the 16 ATS_DM_COEFT (#0 to 
#15). The area for the 16 ATS_DM_COEFT (#0 to #15) 
are provided constantly. 

[0178] FIG. 25 is a diagram to help explain the eon- 
tents of the audio title set program chain information 
25 table (ATS_PGCIT) included in the audio title set infor- 
mation (ATSI) shown in FIG. 23. The recording position 
of the ATS_PGCIT is written in the ATS_PGCIT_SA in 
the ATSLMAT of FIG. 24. 

[0179] As described earlier, the ATS_PGCIT includes 
30 audio title set program chain information table informa- 
tion (ATS_PGCITI), an audio title set program chain 
information search pointer (ATS_PGCI_SRP), and 
audio title set program chain information (ATS_PGCI). 
[0180] The ATS_PGCI_SRP includes one or more 
35 audio title set program chain information search point- 
ers (ATS_PGCI_SRP #1 to ATS_PGCI_SRP #j). The 
ATS_PGCI includes as many pieces of audio title set 
program chain information (ATS_PGCI #1 to ATS.PGCI 
#j) as ATS_PGCI_SRP. 
40 [0181] Each ATS_PGCI functions as navigation data 
for controlling the reproduction of the audio title set pro- 
gram chain (ATS.PGC). 

[0182] Here, the ATS.PGC is a unit to define an 
audio-only title (AOTT) and is composed of ATS_PGCI 

45 and one or more cells (cells in the AOTT_AOBS or cells 
in the AOTT_VOBS used as an object for the AOTT). 
[0183] Each ATS_PGCI includes general information 
on audio title set program chains (ATS_PGC_GI), an 
audio title set program information table (ATS_PGIT), 

50 an audio title set cell playback information table 
(ATS_C_PBIT), and an audio title set audio still video 
playback information table (ATS_ASV_PBIT). 
[01 84] The ATS_PGIT includes one or more pieces of 
audio title set program information (ATS_PGI #1 to 

55 ATS_PGI #k). The ATS_C_PBIT includes as many 
pieces of audio title set cell playback information 
(ATS_C_PBI #1 to ATS_C_PBI #k) as the ATS_PGI. 
[0185] FIG. 26 is a table showing the contents of the 
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audio title set program information (ATS_PGI) shown in 
FIG. 25. The ATS_PGI includes the contents of the 
audio title set program (ATS_PG_CNT), the entry cell 
number of ATS.PG (ATS_PG_EN_CN), the playback 
start time of the first audio cell in ATS_PG 
(FAC_ST_PTM), the playback time of ATS_PG 
(ATS_PG_PB_TM), the pause time of ATS.PG 
(ATS_PG_PA_TM), reservation for copy management 
information, and other reservations. 
[0186] The ATS_PG_CNT includes a description 
showing the relationship between the preceding pro- 
gram and the present program in terms of physical allo- 
cation, a description showing the relationship between 
the preceding program and the present program in 
terms of playback time stamp, a description (ATRN) 
showing the attribute of AOB or the attribute of the audio 
stream in the VOB in the ATS_PG, and a description 
(DM_COEFTN) showing a coefficient table number for 
effecting the down-mixing of an AOB in the ATS_PG 
(AOB.PG) having the number of AOTT_ AO B_ ART or 
AOTT_VOB_ART defined in the ATS l_ MAT by using the 
number of ATS_DM_COEFT defined in ATSLMAT 
[01 87] The ATS_PG_EN_CN includes a description of 
the first ATS cell number (1 to 255) constituting an 
ATS.PG. 

[0188] The FAC_SA_PTM includes a description of 
the low-order 32 bits in the playback time stamp (or 
presentation time stamp PTS) written in the head audio 
packet in the first audio cell in the ATS_PG. 
[0189] The ATS_PG_PB_TM is a description of the 
total playback time of each cell in the ATS.PG. The total 
playback time (seconds) is obtained by dividing 
ATS_PG_PB_TM (32-bit data) by 90000. 
[0190] The ATS_PG_PA_TM is a description of the 
pause time able to be defined at the beginning of the 
ATS_PG. The pause time (seconds) is obtained by 
dividing ATS_PG_PA_TM (32-bit data) by 90000. 
[0191] FIG. 27 is a table showing the contents of the 
audio title set cell playback information (ATS_C_PBI) 
shown in FIG. 25. The ATS_C_PBI includes the index 
number (ATS_C JXN) of a cell in the audio title set, the 
type of ATS_C (ATS_C_TY), the start address of ATS_C 
(ATS_C_SA), the end address of ATS_C (ATS_C_EA), 
and other reservations. 

[0192] When the ATT has no AOBS, "01 h" is written in 
the ATS_C_IXN. When the ATT has AOBS, the contents 
of ATS_C_IXN are as follows according to the contents 
of the ATT_C. 

[0193] * When ATS C is a silent cell described earlier, 
"00h" is written in ATS_C_IXN as the index number for 
the ATS_C. 

[0194] * When ATS C is an audio cell described ear- 
lier, one of "1" to "99" is written in ATS_C_IXN as the 
index number for the ATS_C. 

[0195] The index number of the first audio cell (the 
one having the ATS_C with the lowest number exclud- 
ing the silent cell) is set at "I." A similar index number 
may be allocated to one or more ATS_C in the ATS.PG. 



[0196] When the ATT has no AOBS, "0" is written in all 
the bits in the ATS_C_TY. On the other hand, when the 
ATT has AOBS, the structure of ATT_C 
(ATS_C_COMP) and its usage (ATS_C_Usage) are 
5 written in the ATS_C_TY. 

[0197] Specifically, when the cell is an audio cell com- 
posed only of audio data, "00b" is written in 
ATS_C_COMP (2 bits). 

[0198] When the cell is an audio cell composed of 
w audio data and real-time information, "01b" is written in 
ATS_C_COMP (2 bits). 

[0199] When the cell is a silent cell composed only of 
silent audio data, "10b" is written in ATS_C_COMP (2 
bits). 

15 [0200] In the ATS_C_Usage, the data "0001 b" indicat- 
ing such usage as "a spotlight section" for highlighting 
(spotlighting) a specific portion of the audio manager 
menu (AMGM) displayed is written. 
[0201] When the ATS has AOTT_AOBS, the start 

20 address of ATS_C expressed in the relative logical block 
number counted from the first logical block in the 
AOTT_AOBS in which ATS_C has been recorded is 
written in the ATS_C_SA. 

[0202] On the other hand, when the ATS has no 
25 AOTT_AOBS, the start address of ATS_C expressed in 
the relative logical block number counted from the first 
logical block in the AOTT_VOBS in which ATS_C has 
been recorded is written in the ATS_C_SA. 
[0203] When the ATS has AOTT_AOBS, the end 
30 address of ATS_C expressed in the relative logical block 
number counted from the first logical block in the 
AOTT_AOBS in which ATS_C has been recorded is 
written in the ATS_C_EA. 

[0204] On the other hand, when the ATS has no 
35 AOTT_AOBS, the end address of ATS_C expressed in 
the relative logical block number counted from the first 
logical block in the VTSTT_VOBS in which ATS_C has 
been recorded is written in the ATS_C_EA. 
[0205] FIG. 28 is a diagram showing the contents 
40 of the audio title set audio still video playback 
information table (ATS_ASV_PBIT) shown in FIG. 
25. The ATS_ASV_PBIT includes audio title set 
program audio still video playback information 
search pointers (ATS_PG_ATS_PBLSRP #1 to 
45 ATS_PG_ASV_PBI_SRP #m) and audio title set audio 
still video playback information (ATS_ASV_PBI #1 to 
ATS_ASV_PBI #n). Here, n and m meets the expres- 
sion: n s m s 99. 

[0206] FIG. 29 is a table showing the contents of the 
so audio title set program audio still video playback infor- 
mation search pointers (ATS_PG_ASV_PBI_SRP). The 
ATS_PG_ASV_PBI_SRP includes the number 
(ASVUN) of audio still video units (ASVU), the display 
mode (ASV_DMOD) of audio still video (ASV), the start 
55 address (ATS_ASV_PBLSA) of audio title set audio still 
video playback information (AST_ASV_PBI), and the 
end address (ATS_ASV_PBI_EA) of audio title set 
audio still video playback information (ATS_ASV_PBI). 
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[0207] The ATS_ASV_PBI includes display lists 
(ASV.DLIST #1 to ASV_DLIST #k: k s 99) for audio still 
video (ASV). 

[0208] FIG. 30 is a block diagram of an apparatus (a 
DVD player) for reproducing the recorded information in 
the DVD audio zone 71 of FIG. 3 or the recorded infor- 
mation in the DVD video zone 72 of FIG. 5 from the opti- 
cal disk (DVD audio disk) 10 of FIG. I.The reproducing 
apparatus has the configuration of a DVD video/DVD 
audio compatible player capable of performing not only 
audio playback but also video playback. Explanation of 
the concrete configuration will not be given. The player 
may be compatible with the current CD playback. 
[0209] The reproducing apparatus for the optical disk 
10 shown in FIG. 30 includes a remote controller 5 that 
receives the user's operations, a remote controller 
receiving section that receives the operating state of the 
remote controller 5, a key input section 4 that receives 
the user's operations on the reproducing apparatus 
body side, and a panel display section 4B that is pro- 
vided on the reproducing apparatus body (or/and the 
remote controller 5) and informs the user of the result of 
the user's operation and the playback state of the opti- 
cal disk 10. The other external units include a monitor 
section 6 and a speaker section 8L/8R. The speaker 
section is for two-channel stereo. To effect multichannel 
playback, as many speaker systems and driving ampli- 
fiers for the speakers as are needed for multichannel 
playback must be prepared. 

[0210] The key input section 4, panel display section 
4B, remote controller 5, and monitor section 6 constitute 
a visual user interface. The monitor section 6 is used 
not only as a playback image monitor for still-picture- 
added DVD audio disks, but also as display means, 
such as an on-screen display (OSD). The monitor sec- 
tion 6 is not limited to a direct-view display, such as a 
CRT display, a liquid-crystal display, or a plasma display, 
and may be a video projector that projects various 
images (such as a menu screen, still pictures showing 
the state of the recording spot, and others) including the 
OSD information on a large screen. 
[0211] Information on the user's operations from the 
remote controller 5 is sent via a remote controller 
receiving section 4A to the microcomputer (MPU or 
CPU) 500 of a system control section 50 that controls 
the operation of the entire reproducing apparatus. The 
control section 50 includes a ROM 502 in which a con- 
trol program and others to be executed by the MPU 500 
have been stored. 

[0212] Information on the user's operations from the 
key input section 4 is sent directly to the MPU 500. The 
MPU 500 displays the operating state of the reproduc- 
ing apparatus (various setting states and playback infor- 
mation on the DVD disk) according to information on the 
user's operations on the panel display section 4B. 
[0213] A RAM 52 and a memory interface (memory 
l/F) 53 are connected to the MPU 500. The input/output 
control of the RAM 52 is carried out via the memory l/F 



53. The MPU 500 uses the RAM 52 as a work area. On 
the basis of various processing programs stored in the 
ROM 502, the MPU 500 controls the operations of a 
disk drive section 30, a system processor section 54, a 
5 video decoder section 58, an audio decoder section 60, 
a sub-picture decoder section 62, and a D/A converting 
& reproducing section 64. 

[0214] The disk drive section 30 not only rotates the 
optical disk 10 set in the tray (the inside of the DISK 

w TRAY INLET of FIG. 31) of the reproducing apparatus 
body, but also reads the recorded data (audio data 
including voice/music information and, if recorded on 
the optical disk 10, main picture data/video data includ- 
ing moving picture information/still picture information, 

15 and sub-picture data including subtitle informa- 
tion/menu information) from the optical disk 10. The disk 
drive section 30 subjects the read-out data to signal 
processes, including signal demodulation and error cor- 
rection, and converts the processed data into data 

20 strings in pack form (see FIGS. 4 and 6). The resulting 
data is sent to the system processor section 54. 
[0215] The system processor section 54 has a packet 
transferring section (not shown) that judges the types of 
various packets included in the data reproduced from 

25 the optical disk 10 and delivers the data items in the 
packet to the corresponding one of the decoders (58, 
60, 62). 

[0216] The packet transferring section segments the 
pack-form data string from the disk drive section 30 by 
30 the type of pack (such as, navigation pack, video pack, 
sub-picture pack, audio pack, or real-time information 
pack). An ID data item indicating the transfer time data 
item and the type of data is recorded in each of the seg- 
mented packs. 

35 [0217] Referring to the transfer time data item and the 
ID data item, the system processor section 54 transfers 
video packs, sub-picture packs, and video packs to the 
video decoder section 58, sub-picture decoder section 
62, and audio decoder section 68, respectively. An 

40 audio pack or a real-time information pack correspond- 
ing to a silent cell is sent to the audio decoder section 
60. 

[0218] The system processor section 54 transfers the 
control data in the navigation pack to the RAM 52 via 
45 the memory l/F 53. The MPU 500, referring to the trans- 
ferred control data in the RAM, controls the playback 
operation in each section of the reproducing apparatus 
body. 

[0219] The video decoder section 58 decodes the 
so video data MPEG-encoded in the video pack trans- 
ferred from the system processor section 54 and cre- 
ates the uncompressed image data. 
[0220] The sub-picture decoder section 62 decodes 
the sub-picture data run-length-compressed in the sub- 
55 picture pack transferred from the system processor sec- 
tion 54 and creates the uncompressed bit map sub-pic- 
ture data. The sub-picture decoder section 62 includes 
not only a sub-picture decoder for decoding the sub-pic- 
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ture data from the system processor section 54 but also 
a highlighting section (in the case of DVD video) for 
decoded sub-picture data. ' 

[0221] The sub-picture decoder expands the pixel 
data (including highlighted pixels, pattern pixels, and 5 
background pixels) in units of a specific number of bits 
(two bits) run-length-compressed according to a spe- 
cific rule and restores the original bit map image. 
[0222] The highlighting section performs a corre- 
sponding highlighting process according to the X-Y 10 
coordinate values indicating the rectangular area in 
which the highlight information (e.g., the choices on a 
menu), color codes, and highlight color/contrast value 
supplied from the MPU 500. 

[0223] The highlighting process can be used as 15 
means for enabling the user to easily recognize a dis- 
played specific item (the operator for selecting the type 
of reproduced spoken language and the type of lan- 
guage used in reproduced subtitles or the operator for 
selecting a specific item, such as the sampling fre- 20 
quency of the reproduced sound, the number of quanti- 
zation bits, or the number of playback channels) on a 
visual user interface on the monitor section 6. 
[0224] When the color and contrast of each pixel in the 
decoded sub-picture data are changed according to the 25 
highlight information, the changed sub-picture data is 
supplied to the image combining section (not shown) of 
a video processor section 640. The image combining 
section combines the decoded image data with the 
highlighted sub-picture data. The resulting image is dis- 30 
played on the monitor section 6. 
[0225] The RAM 52 includes a menu table for storing 
the start addresses of a sub-picture menu, an audio 
menu, an angle menu, and a chapter (program) menu. 
To highlight a specific part of these menus, the highlight- 35 
ing process is used. 

[0226] The audio decoder section 60 decodes the 
audio data in the audio pack transferred from the sys= 
tern processor section 54 and creates audio data for 
two-channel stereo or multichannel stereo. When the 40 
audio data in the audio pack is compression-encoded 
data (such as MPEG or AC-3), the audio decoder sec- 
tion 60 also decodes the data. 

[0227] The image data (normally, moving-picture sig- 
nals) decoded by the video decoder section 58 and the 45 
sub-picture data (normally, the bit map data on subtitles 
and menus) decoded by the sub-picture decoder sec- 
tion 62 are transferred to a video processor 640. The 
video processor 640 mixes the image data- and the 
sub-picture data in a specific ratio to produce the final so 
analog image signals (composite video signals, sepa- 
rate S signals, or component signals Y/Cr/Cb) and out- 
puts these signals to the monitor section 6. 
[0228] When the image data decoded by the video 
decoder section 58 is the main part of the movie on a 55 
DVD video disk, the sub-picture data is usually the sub- 
titles in the language selected by the user. The monitor 
section 6 displays the main part of the movie with subti- 



tles. 

[0229] When the image data decoded by the video 
decoder section 58 is the menu section of the movie, 
the sub-picture data serves as the characters constitut- 
ing menus and user select operators (subjected to the 
highlighting process, when necessary). In this case, the 
background (still picture or moving picture) of the menu 
is displayed according to the image data and the opera- 
tors whose representations are changed by the user's 
select operation are displayed on the background 
screen according to the sub-picture data. 
[0230] On the other hand, when the image data 
decoded by the video decoder section 58 is a still pic- 
ture on a DVD audio disk, the sub-pictuce data is, for 
example, an explanatory text in the language selected 
by the user. In that case, the still picture with text is dis- 
played on the monitor section 6. 
[0231] The video processor section 640 includes an 
OSD section that generates display data for on-screen 
display. The user's operations from the remote control- 
ler 5 or the like are processed by the MPU 500. The 
result of the processing is sent from the MPU 500 to the 
OSD section of the video processor 640. The OSD sec- 
tion generates image data corresponding to the result of 
processing from the MPU 500 and sends the image 
data to the monitor section 6 in analog image signal 
form. 

[0232] In other words, the video processor section 640 
converts the digital signals from the video decoder sec- 
tion 58 and sub-picture decoder section 62 into analog 
signals and multiplexes them. 

[0233] A frame memory section 642 is connected to 
the video processor section 640. The frame memory 
section 642 is used not only to multiplex the pictures of 
the image data and the pictures of the sub-picture data 
but also to provide an n-partition (e.g., 4-partition) multi- 
screen display. 

[0234] When chapter searching is done, the frame 
memory section 642 can fix part of the images from the 
video decoder section 58 as still pictures and use them 
when sending the still pictures to the monitor section 6 
until the target chapter starts to be reproduced. 
[0235] When the display corresponding to the result of 
the user's operation is made by the OSD, the frame 
memory section 642 can be used in multiplexing the 
image data with the OSD display. 
[0236] The audio data decoded at the audio decoder 
section 60 is transferred to a DAC & output circuit 644. 
The DAC & output circuit 644 converts the audio data 
(digital) from the audio decoder section 60 into the cor- 
responding analog audio signal, amplifies the analog 
audio signal suitably, and outputs it to the speaker sec- 
tion 8L/8R. 

[0237] When the multichannel audio is down-mixed to 
two channels on the basis of the contents of 
ATS_DM_COEFT in the audio title set information man- 
agement table (ATS l_ MAT) shown in FIG. 24, the MPU 
500 sends the down-mix coefficient (parameter) to the 
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DAC & output circuit 644. Then, on the basis of the coef- 
ficient received, the DAC & output circuit 644 down- 
mixes the multichannel audio data decoded at the audio 
decoder section 60 into two channels and outputs two 
channel analog audio signals. 

[0238] The video processor section 640, frame mem- 
ory section 642, and DAC & output circuit 644 constitute 
the D/A converting/reproducing section 64. 
[0239] Each of the system processor section 54, video 
decoder section 58, audio decoder section 60, and sub- 
picture decoder section 62 includes a register for tem- 
porarily storing a system time clock (STC) used to know 
the operation timing and the instructions and pieces of 
information from the system MPU 500. 
[0240] FIG. 31 shows an example of the front panel of 
the reproducing apparatus shown in FIG. 30. The front 
panel is provided with a fluorescent display section (FL 
display) 4B corresponding to the panel display section 
4B shown in FIG. 30. 

[0241] On the FL display 4B of FIG. 31, an album 
name and/or a group name are displayed in characters 
according to the audio text data manager ATXTDT_MG 
in the AMGI. Taking FIG. 8 as an example, "THE FIRST 
VOLUME OF WORKS BY BEETHOVEN" is displayed 
as an album name and "SYMPHONY NO. 1" is dis- 
played as a group name. 

[0242] On the left numeral display section of the FL 
display 4B, a title number (in the case of DVD video) or 
group number (in the case of DVD audio), a track 
number, and index number are displayed. 
[0243] When the optical disk 10 set in the disk tray of 
FIG. 31 is an AV disk (a disk with ATT.SRP of FIG. 19), 
the "AV DISK" part is highlighted as shown in the figure 
on the character display section somewhat in the middle 
and on the right side of the FL display 4B. When the 
optical disk 10 set is an A disk (a disk with AOTT_SRT 
of FIG. 21), the "A DISK" part is highlighted on the right 
character display section of the FL display 4B. When the 
optical disk 10 set is a video disk with only VTS and no 
ATS (a disk without the ATS directory of FIG. 11), the 
"VIDEO DISK" part is highlighted on the right character 
display section of the FL display 4B. 
[0244] Furthermore, on the right numeral display sec- 
tion of the FL display 4B, the sampling frequency and 
the number of quantization bits in the-audio contents to 
be reproduced are displayed. The display can be made 
automatically on the basis of AOTT_ AO B_ ATR or 
AOTT_VOB_ART in the audio title set information man- 
agement table ATS L MAT. 

[0245] The following two types of DVD audio player 
that plays back a DVD audio disk (A disk or AV disk) can 
be considered: 

(C1 ) Player capable of reproducing images and 
sound complying with the audio standard. 
<C2) Player capable of reproducing only sound 
complying with the audio standard. 



[0246] The <C1 )-type player has only to read the 
search information (FIG. 19) written in the ATT_SRPT 
for playback of contents. 

[0247] On the other hand, the < C2 Mype player has 

5 only to read the search information (FIG. 21) written in 
the AOTT.SRPT for playback of contents. 
[0248] By doing this, the reproducing method on each 
type of player is simplified. Because the <C2)-type 
player cannot reproduce ATT #1 and ATT #9 of FIG. 22 

10 because they have no AOTT. 

[0249] The reproducing apparatus of FIG. 30 is a 
player of the <C1 > type. The operation of the player 
playing back the optical disk 10 with the data structure 
of FIG. 13 will be explained. 

15 [0250] When the optical disk 1 0 with the data structure 
of FIG. 13 is played back on an ordinary DVD video 
player, the video player reads the VMG in the VTS direc- 
tory under the root directory of FIG. 1 1 and, on the basis 
of the information, determines the title to be repro- 

20 duced. Then, according to the instruction given by the 
playback unit defined in the VTS corresponding to the 
determined title, all of or part of the object set (VOBS #1 
or VOBS #2) of FIG. 13 is reproduced. 
[0251] In the data structure of FIG. 13, the video 

25 player recognizes the parts excluding VMG, VTS #1, 
and VTS #2 as the other recording areas 73 (see FIGS. 
3 and 5). Therefore, no matter what type of data has 
been written in the parts recognized as the other record- 
ing areas 73, this has no effect on the video player 

30 reproducing VOBS #1 and VOBS #2. In this case, the 
video player cannot reproduce the objects present in the 
other recording areas 73. 

[0252] On the other hand, when the optical disk 10 
with the data structure of FIG. 13 is played back on a 

35 DVD audio player of FIG. 30, the audio player reads the 
AMG in the ATS directory under the root directory of 
FIG. 1 1 and reproduces the contents on the basis of the 
information. In title specification using AMG, not only 
the playback unit defined in the ATS recorded in the 

40 DVD audio zone 71 (see FIG. 3) but also the playback 
unit defined in the VTS recorded in the DVD video zone 
72 (see FIG, 5) can be specified. 
[0253] The playback unit defined in the ATS can spec- 
ify not only the playback route of the object (AOBS #1 or 

45 AOBS #2) recorded in the DVD audio zone 71 but also 
the playback route of the audio data recorded in the 
object (e.g., VOBS #1) in the DVD video zone 72. 
[0254] The VOBS #1 marked with slanted lines in FIG. 
13 indicates part of the DVD video shared by the DVD 

50 audio side. Here, the arrow (A) indicates the case where 
the playback unit in the DVD video zone 72 is referred 
to. The arrow (B) indicates the case where the playback 
unit in the DVD audio zone 71 refers to the audio part of 
the object (VOBS #1) in the DVD video zone 72. 

55 [0255] When the audio part of the object (VOBS #1 ) in 
the DVD video zone 72 is referred to by the playback 
unit in the DVD audio zone 71, the common reference 
part (the part shared by the DVD audio and DVD video) 
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can be defined differently from each unit (such as cells, 
programs, or program chains) defined by the definition 
information (VTSI) in the playback unit in the DVD video 
zone 72, on the basis of the definition information (ATSI) 
in the playback unit. This is because the video player 
may differ from the audio player in the reproducing 
method, although the objects are the same (see FIG. 7). 
[0256] The shared part is used by using a video object 
unit VOBU as a unit. The reason is that the unit in which 
an audio data stream and other (video and sub-picture) 
data streams are each packed and time-division-multi- 
plexed is a VOBU. 

[0257] As shown in FIG. 13, by placing the DVD audio 
zone 71 physically in front of the DVD video zone 72, all 
the addresses of the playback units specified in the indi- 
vidual pieces of the management information can be 
limited to only ascending address specification. This 
simplifies the design and development of audio players. 
[0258] The operation of a video player in the data 
structure of FIG. 1 5 is the same as in FIG. 1 3. The oper- 
ation of an audio player in the data structure of FIG. 15 
is almost the same as in FIG. 13. The audio player 
jumps to the head of the AMG, reads the management 
information, and reproduces the audio object sets 
(AOBS #1 and AOBS #2). The AOBS #1 is an object in 
the DVD video zone 72. Using ATSI #1 , cells, programs, 
and program chains in AOBS #1 are defined again. In 
AOBS #1, VOBU is used as a unit. 
[0259] In the above embodiment, the case where the 
DVD audio data and/or DVD video data included in the 
volume space is recorded on the optical disk 10 has 
been explained. However, the data structure of the 
present invention (see FIGS. 3 to 29) is not limited to the 
case where the data is recorded on the optical disk 10. 
For instance, bit streams including the data having the 
structures shown in FIGS. 3 and 1 1 may be used in dig- 
ital broadcasting or digital communications. In this case, 
electromagnetic waves or communication lines function 
as mediums. Moreover, communication terminals, such 
as DVD broadcasting receivers, or personal computers, 
function as DVD audio players. 
[0260] Systems to which the present invention is appli- 
cable have been generally explained. The points a 
stress is laid on in the invention will be explained in 
order. 

[0261] The main point is that the cell structure in DVD 
audio is given a characteristic. First, there are the fol- 
lowing two types of DVD audio, depending on the type 
of data dealt with: 

[A-1] Audio with Video: a system that handles both 
audio data and video data. 
[A-2] Audio without Video: a system that handles 
only audio data and deals with no video data. 

[0262] The data structure of type [A-1] is basically the 
same as that in the DVD standard. What the present 
invention deals with is related to the audio data struc- 



ture of type [A-2]. The optical disk 10 for the DVD audio 
system has the structure as explained earlier. All of the 
one side of the optical disk 10 is defined as a volume. A 
title group (TT_GR) is a component element of a vol- 
5 ume and composed of one or more audio titles (ATT). 
The TT_GR compares to an album in a record or a CD. 
It is assured that track groups in one TT_GR can be 
reproduced continuously. 

[0263] There are the following two types of ATT: 

10 

[B-1] Audio with Video Title (AVTT): a title made up 
of audio data and video data. 
[B-2] Audio Only Title (AOTT): a title made up of 
only audio data. 

15 

[0264] The AVTT and AOTT are generally called ATT. 
As described above, since the present invention deals 
with the data structure of type [A-2], explanation of 
AOTT will be given. One AOTT is made up of one PGC. 

20 More specifically, as shown in FIG. 3, one AOTT is com- 
posed of the program chain information (ATS_PGCI) in 
the ATS and one or more cells in the audio object set 
(AOBS) in the corresponding ATS. 
[0265] A track is a program (PG) defined in the PGC. 

25 One track is composed of one PG. The track is made up 
of one or more cells. 

[0266] Generally, in the audio contents, a track is used 
as a unit in separating pieces of music. A cell is used as 
a unit in separating the numbers in a piece of music. 
30 The playback of the audio contents is defined by speci- 
fying the playback sequence of cells. 
[0267] The following specifications are required for the 
audio data structure of type [A-2]: 

35 [C-1] The attribute of audio data has to be able to 
be set track by track. 

[0268] About [C-1]: in a music CD, the attributes 
(including the sampling frequency fs and the number of 

40 quantization bits Qb) of each piece of music in one 
album are all the same. In DVD audio, however, 
attributes are allowed to be set piece by piece to 
increase the degree of freedom of the sound source. 
Specifically, the content provider can set attributes track 

45 by track. The attributes for each track in DVD audio 
include sampling frequency, the number of quantization 
bits, channel assignment, and down-mix coefficient. 
[0269] As described above, when the audio data ful- 
filling the specification in item [C-1] is reproduced on a 

so DVD player, there arises a sound break problem at the 
start of playback of tracks. However, from the viewpoints 
of contents, the sound break should be managed by the 
producer. Moreover, as described above, the length of a 
sound break should be the same, regardless of a player 

55 with or without a video reproducing function. 

[0270] Therefore, according to the present invention, 
there is provided a data structure which enables the 
producer to set the length of sound break time by him- 
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self and which enables a player to determine whether a 
sound break is present or not in the playback procedure 
defined by the producer and realizes the setting of the 
sound break time set by the producer. 
[0271] A sound break may occur in a place where the 
attributes of two tracks making a playback transition dif- 
fer from each other. Specifically, when the player repro- 
duces a track with an attribute and the attribute of the 
next track to be reproduced differs from the preceding 
one, the player has to make various settings [including 
the setting of buffers affected due to the difference in the 
number of quantization bits, the setting of the clock 
(sampling) frequency, and the setting of the number of 
channels). During the settings, the data transfer is 
stopped and therefore a break in the sound takes place. 
Naturally, the sound break does not occur when the 
attributes of two tracks making a playback transition are 
the same. Consequently, a sound break may or may not 
take place in one title group (TT_GR). 
[0272] Since the sound break is ascribed to a physical 
cause, it cannot be solved in terms of the application- 
level data structure. Therefore, the system of the 
present invention positively admits the existence of a 
sound break described above and constructs such a 
data structure as allows the content provider to manage 
the sound break time length. The resulting form gives 
no unnatural feeling to the user when it is reproduced. 
[0273] The types of cells of audio data are defined as 
follows: 

[D-1] Audio cell (A_C): a cell composed of ordinary 
audio data. 

[D-2) Silent cell (SI_C): a cell composed of silent 
audio data. 

[0274] Then, the data identifying information to iden- 
tify the contents of cell components is added to the cell 
information, thereby making it possible to discriminate 
between the two types of cells. Here, silent means not 
that audio data does not exist, but that there is audio 
data with an amplitude of zero. 
[0275] FIG. 32 shows the classification of two types of 
cells. 

[0276] An audio cell (A_C) in a audio data cell 
includes no still picture data. A silent cell (SI_C) corre- 
sponds to a special case in an audio data cell. All the 
audio data items in the silent cell are silent. The silent 
cell is used to set and manage the length of time of 
silence. 

[0277] One ATS_PG is composed of one or more 
ATS_C. The ATS_PG corresponds to a track and the 
ATS_C corresponds to a cell. 

[0278] FIGS. 33A and 33B show two ways in which 

ATS_C are arranged in the ATS_PG. 

[0279] Specifically, in [E-1] ATS_PG, only A_C are 

arranged. 

[0280] In [E-2] ATS_PG, the first cell is SI_C and the 
second and later cells are all A_C arranged in 



sequence. 

[0281] All the ATS_C constituting one ATS_PG meet 
the following conditions: 

5 [F-1] All the ATS_C constituting one ATS.PG are 

physically consecutive in the arrangement. 
[F-2] The presentation time stamps (PTS) in all the 
ATS_C constituting one ATS_PG are consecutive. 
[F-3] At least one A_C exists in one ATS_PG. 

10 [F-4] The presentation time of one A_C is one sec- 
ond or longer. 

[F-5) The audio attributes of all the SI_C constitut- 
ing one ATS_PG and those of the A_C group are 
the same. 

15 [F-6] The presentation time of one SI_C is 0.5 sec 
or longer. 

[0282] FIGS. 34, 35, and 36 are diagrams to help 
explain AOTT_AOB and AOTT_AOBS. In the figures, 
20 A_PAK means an audio pack, and PTI_PAC means a 
real-time information pack. 

[0283] As described above, the target title is AOTT of 
the type described in item [B-2]. Therefore, an audio 
object, the substance of the data, is an Audio Object for 
25 Audio Only Title (AOTT_AOB). The AOTT_AOB is com- 
posed of one or more ATS_C. Each ATS_C is com- 
posed of pack groups. 

[0284] The data included in the AOTT_AOB is audio 
data. The audio data includes silent audio data (with an 

30 amplitude level of zero as described earlier). It also 
includes a little additional data of non-image [this is 
called real-time information data], such as text data, as 
a special example in the form of RTI packs. 
[0285] The AOTT_AOB has to include audio data. All 

35 the attributes of the audio data in one AOTT_AOB have 
to be the same. The still picture data is included option- 
ally in the AOTT_AOB. The still picture included in one 
program (PG) has to be outputted before the audio data 
in the program is reproduced. 

40 [0286] One AOTT.AOB is one program stream or part 
of the stream written according to the system part 
of the MPEG-2 standard (ISO/IEC 13818-1). The 
AOTT_AOBS is a collection of AOTT_AOB. As defined 
in [D-1] and [D-2], the following two types of cells are 

45 defined in one AOTT_AOB. 

[G-1] An audio cell (A_C) is composed of only audio 
data packs groups (see FIG. 34) or of audio data 
pack groups and additional non-image data (RTI 

so data) pack groups (see FIG. 35). Its presentation 
time is one second or longer. 
[G-2] A silent cell (SI_C) is composed of only silent 
audio data pack groups (FIG. 36) and is used to set 
a silent period. The presentation time for one SI_C 

55 is 0.5 second or longer. 

[0287] The following two cases can be considered in 
connection with the relationship between two adjacent 
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PGs on the PGC: 

[1-1] A PG and the preceding PG have the same 
attributes, 

[I-2] A PG and the preceding PG have different 
attributes. 

[0288] In the PG, the attributes of the audio data the 
cells in the PG have and the PG information to define 
the temporal relationship with the preceding PG are 
written. By recognizing the contents and information on 
the cell arrangement (playback sequence) in the PG, 
two states in item [1-1] and item [I-2] can be recognized 
easily on a player. 

[0289] FIGS. 37A to 37C and FIGS. 38A to 38C show 
the relationship between PTS and playback time in two 
adjacent PGs in the two cases described in item [1-1] 
and [I-2]. 

[0290] FIGS. 37A to 37C show a case in item [1-1]. 
FIGS. 37A, 37B, and 37C illustrate the arrangement 
(PG) of audio packs on a track, the values of presenta- 
tion time stamps (PTS), and the passage of playback 
time, respectively. In this case, the consecutive transfer 
of audio data streams is maintained and the continuity 
of audio playback is also maintained. The PTS is reset 
at the beginning of the PG. 

[0291] In this case, the first cell in the PG might be 
SI_C but the state remains unchanged. The reason is 
that SI_C is a kind of audio cell and corresponds to a 
special case where all the audio data items have an 
amplitude level of zero. 

[0292] FIGS. 38A to 38C show a case in item [I-2]. 
FIGS. 38A, 38B, and 38C illustrate the arrangement 
(PG) of audio packs on a track, the values of presenta- 
tion time stamps (PTS), and the passage of playback 
time, respectively. In this case, too, the consecutive 
transfer of audio data streams is maintained. Since 
hardware resetting is necessary in changing the 
attributes, audio playback is discontinued during the 
resetting. 

[0293] In this case, the content producer can manage 
the intervals of silent time by setting the first celt in the 
PG to SI_C. Specifically, the content producer can set 
the intervals of silent time at will by setting the length of 
SLC (0.5 sec or longer). 

[0294] Although PTS are written discontinuously 
between adjacent PGs in FIGS. 37A to 37C and FIGS. 
38A to 38C, the PTS may be continuous because the 
audio data streams in both PGs are continues. 
[0295] The two types of cells will be described system- 
atically as follows. First, as shown in FIG. 3, the DVD 
audio zone 71 is composed of a simple audio manager 
(SAMG) file, an audio manager (AMG) file, an audio still 
video set (ASVS) file, and an audio title set (ATS) file. 
The audio title set (ATS) is composed of an audio title 
set information (ATSI) file, an audio-only title audio 
object set (AOTT_AOBS) file, and an audio title set 
information backup (ATS_BUP) file. 



[0296] As shown in FIG. 23, the audio title set informa- 
tion (ATSI) is composed of an audio title set information 
management table (ATS.MAT) file and an audio title set 
program chain information table (ATS_PGCIT) file. 

5 [0297] The ATS_PGCIT is composed of an audio title 
set program chain information table information 
. (ATS_PGCITI) file, an audio title set program chain 
information search pointer (ATS_PGCI_SRP) file, and 
an audio title set program chain information 

w (ATS.PGCI) file. 

[0298] As shown in FIG. 25, the ATS_PGCI is com- 
posed of an audio title set program chain general infor- 
mation (ATS_PGC_GI) file, an audio title set program 
information table (ATS.PGIT) file, an audio title set cell 

15 playback information table (ATS_C_PBIT) file, and an 
audio title set audio still video playback information table 
(ATS_ASV_PBIT) file. 

[0299] An item is set as a variable in each piece of 
audio title set cell playback information (ATS_C_PBI) 
20 written the audio title set cell playback information table 
(ATS_C_PBIT). The variable is the audio title set ceil 
type (ATC_C_TY) shown in FIG. 27. This specifies 
which one of the following items the cell falls under: 

25 [J-1] An audio cell (A_C) composed of only audio 
data. 

[J-2] An audio cell (A_C) composed of audio data 
and real-time information. 

[J-3] A silent cell (SI_C) composed of only silent 
30 audio data with an amplitude level of zero. 

[0300] Furthermore, "00h" is specified for variable 
(ATC_C_IXN) when the cell is SI_C. When the cell is 
A_C, the index number (in the range of 1 to 99) of the 
35 cell is specified. 

[0301 ] Because the player knows the types of the indi- 
vidual cells from these pieces of information, it can rec- 
ognize the presence or absence of a break in sound 
beforehand. 

40 [0302] FIG. 39 shows a reproducing apparatus for 
playing back the DVD audio disk. Because the repro- 
ducing apparatus is a unit for reproducing only audio 
data, it has no system for processing video data and 
sub-picture data, as compared with the apparatus of 

45 FIG. 30. When a disk on which image data has been 
recorded is played pack, the reproducing apparatus 
simply ignores the image data periods. 
[0303] Specifically, even when the image data has 
arrived, the system processor section 54 does not 

so transfer the data to the audio decoder section 60. When 
the silent cell data has arrived, it transfers the data as 
audio data to the audio decoder section 60. The remain- 
ing sections are almost the same as those in FIG. 30. 
[0304] Although in the above embodiment, the image 

55 data has been ignored completely, a terminal 54-1 for 
separating and extracting only the image data may be 
provided on the system processor section 54. With this 
configuration, the user can use a disk with images that 
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he or she bought, by, for example, playing back the 
audio disk at home and supplying the image data to the 
decoder input terminal of a DVD player. An audio output 
terminal may, of course, be added. 
[0305] With this reproducing apparatus, when the 
real-time information has been recorded on a disk, the 
data can be demodulated at the system control section 
50 or a separately provided demodulating section and 
displayed on the panel display section 4B. In this case, 
it is- desirable that the panel display section 4B should 
have, for example, a liquid-crystal screen. Various keys, 
including a ten-key pad, are used as the key input sec- 
tion 4. 

[0306] Either type of disk playback apparatus must 
have the function of identifying the cell type. 
[0307] FIG. 40 shows a reproducing apparatus capa- 
ble of playing back a disk on which images have been 
recorded. When image data has been recorded on the 
disk, the reproducing apparatus reproduces the data 
and displays it on the monitor 6. When a disk of another 
type is played back, the reproducing apparatus oper- 
ates as the apparatus of FIG. 39 does. 
[0308] While the present invention has been explained 
using the recording medium and the disk playback 
apparatus, it may be applied to a case where the audio 
information defined as described above is transmitted 
via a transmitting unit and received by a receiving unit 
and thereafter reproduced. Furthermore, the invention 
may be applied to a case where a control signal for real- 
izing the function of receiving and processing the afore- 
mentioned audio information is transmitted to a 
receiving unit and thereafter the audio information 
defined as described above is read from a transmitting 
or recording medium and reproduced. 

Industrial Applicability 

[0309] As described above, the present invention pro- 
duces the following effect. The audio attributes can be 
specified track by track. This causes a sound break 
problem, which is ascribed to a break in the necessary 
time for resetting the hardware environment of the 
player and in the audio output as a result of the change 
of the audio attributes. 

[0310] To solve this problem, audio cells and silent 
cells have been defined and the arrangement of them 
been limited. The introduction of such a concept ena- 
bles the content producer to positively manage and set 
the sound break time. For example, when tracks with 
sound breaks mingle with tracks with no sound break, 
silent periods of time can be standardized in any track 
by placing a silent cell at the head of each track with no 
sound break. This prevents the mixture of tracks with 
and without sound breaks from giving unnatural feeling 
to the user. 



Claims 

1. An audio data structure characterized in that, in 
audio contents which have cells for defining at least 

5 an audio title playback unit and whose actual play- 
back sequence is determined by defining the play- 
back sequence of the cells, identification 
information to specify the types of the cells accord- 
ing to the difference in the contents of the data 

10 included in said cells is included in cell information 
to specify said cells. 

2. The data structure according to claim 1 , character- 
ized in that one type of the contents of the data in 

15 said cells is for obtaining the length of a silent 
period of time and the identification information cor- 
responding to the cell indicates a silent cell. 

3. The data structure according to claim 1, character- 
20 ized in that said identification information defines 

audio cells composed of ordinary audio data as 
a first type, and 

silent cells composed of only audio data with 
25 an amplitude level of zero as a second type, 

and 

identifies the types of the cells. 

4. An audio data structure characterized in that, in a 
30 track that defines the playback sequence of one or 

more cell groups, 

an arrangement of audio cells composed of 
ordinary audio data items is allowed as a first 

35 arrangement, and 

an arrangement of silent cells composed of 
only audio data items with an amplitude level of 
zero and audio cells composed of ordinary 
audio data items is allowed as a second 

40 arrangement. 

5. An audio data structure characterized in that, in a 
track that defines the playback sequence of one or 
more cell groups, 

45 

the audio attributes of the cells constituting said 
track and track connection information indicat- 
ing the relationship in playback time between 
the present track and the preceding track are 
so provided between tracks. 

6. The data structure according to claim 5, character- 
ized in that said audio attributes include at least 
sampling frequency, the number of quantization 

55 bits, channel assignment, and down-mix coeffi- 
cient. 

7. The data structure according to claim 3, character- 
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ized in that the presentation time for said audio cells 
is set to one second or longer. 

8. The data structure according to claim 3, character- 
ized in that the presentation time for said silent cells 5 
is set to 0.5 second or longer. 

9. A disk playback apparatus characterized by com- 
prising 

10 

an information recording medium which has 
cells for defining at least an audio title playback 
unit, audio contents whose actual playback 
sequence is determined by defining the play- 
back sequence of the cells, and an audio data 15 
. structure where identification information to 
identify the types of the cells according to the 
difference in the contents of the data included 
in said cells is included in cell information to 
specify said cells, with at least silent data exist- 20 
ing in said cells and said identification informa- 
tion including silent cell identification 
information to identify said silent data cell, and 
means for reading the recorded information on 
said information recording medium and, when 25 
recognizing said silent cell identification infor- 
mation, supplying the audio contents inte- 
grated in the identification information to an 
audio decoder section. 

30 

10. The disk playback apparatus according to claim 9, 
characterized in that said identification information 
defines audio cells composed of ordinary data as 
cells of a first type and said silent cells composed of 
only audio data with an amplitude level of zero as 35 
cells of a second type and discriminates between 
these cells on the basis of cell time information, and 

said playback apparatus includes means for 
judging said cell type information and, when 40 
judging that picture cells have been inputted, 
ignoring the input and, when judging that silent 
cells and audio cells have been inputted, sup- 
plying the data in these cells to an audio 
decoder for playback. 45 

11. A transmission apparatus characterized by trans- 
mitting such information as includes audio contents 
which have cells for defining at least an audio title 
playback unit and whose actual playback sequence so 
is determined by defining the playback sequence of 
the cells, and causes identification information to 
identify the types of the cells according to the differ- 
ence in the contents of the data included in said 
cells to be included in cell information to specify 55 
said cells, one type of the contents of the data in 
said cells being for obtaining the length of the silent 
period of time and the identification information cor- 



responding to the cell indicating a silent cell. 

12. The transmitting apparatus according to claim 11, 
characterized in that the period in which said silent 
cell is transmitted is set between programs. 

13. A receiving apparatus characterized by comprising 
means which receives such information as includes 
audio contents which have cells for defining at least 
an audio title playback unit and whose actual play- 
back sequence is determined by defining the play- 
back sequence of the cells, and causes 
identification information to identify the types of the 
cells according to the difference in the contents of 

. the data included in said cells to be included in cell 
information to specify said cells, one type of the 
contents of the data in said cells being for obtaining 
, the length of the silent period of time and the identi- 
fication information corresponding to the cell indi- 
cating a silent cell and which, when recognizing the 
identification information indicating said silent cell, 
supplies the cell integrated in the identification infor- 
mation to an audio decoder section for playback. 

14. A recording medium on which audio contents are to 
be recorded, the audio contents having cells for 
defining at least an audio title playback unit and 
making actual playback sequence determined by 
defining the playback sequence of the cells, said 
recording medium characterized by recording iden- 
tification information to identify the types of the cells 
according to the difference in the contents of the 
data included in said cells in such a manner that the 
identification information is included in cell informa- 
tion to specify said cells. 

15. The recording medium according to claim 14, char- 
acterized in that one type of the contents of the data 
in said cells is for obtaining the length of the silent 
period of time and the identification information cor- 
responding to the cell indicates a silent cell. 

16. The recording medium according to claim 14, char- 
acterized in that said identification information 
defines audio cells composed of ordinary data as 
cells of a first type, and 

silent cells composed of only audio data with 
an amplitude level of zero as cells of a second 
type and identifies the types of the cells. 

17. A recording medium characterized in that, in a track, 
defining the playback sequence of one or more cell 
groups, 

an arrangement of audio cells composed of 
ordinary audio data items is allowed as a first 
arrangement, and 
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an arrangement of silent cells composed of 
only audio data items with an amplitude level of 
zero and audio cells composed of ordinary 
audio data items is allowed as a second 
arrangement for recording. 5 



18. A recording medium characterized in that, in a track 
defining the playback sequence of one or more cell 
groups, 

10 

the audio attributes of the cells constituting said 
track and track connecting information indicat- 
ing the relationship in playback time between 
the present track and the preceding track are 
recorded between tracks. 15 



19. The recording medium according to claim 18, char- 
acterized in that said audio attributes include at 
least sampling frequency, the number of quantiza- 
tion bits, channel assignment, and down-mix coeffi- 20 
cient. 

20. The recording medium according to claim 16, char- 
acterized in that the presentation time of said audio 
cells is set at one second or longer. 25 



21. The recordinq medium according to claim 16, char- 
acterized in that the presentation time of said silent 
cells is set at 0.5 second or longer. 

30 
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